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THE PSYCHOLOGICAL REVIEW 


DISCRIMINATION OF CUES IN MAZES: A RESOLUTION OF 
THE “PLACE-VS.-RESPONSE” QUESTION * 


FRANK RESTLE ? 


Center for Advanced Study in the Behavioral Sciences 


Whether rats in mazes learn turning 
responses or places is a question which 
has often been subjected to experimen- 
tal test. Data from different experiments 
conflict, and attempts at a definitive an- 
swer seem only to add to the confu- 
sion. This paper will defend the thesis 
that the place-vs.-response question is 
wrongly formulated, and that the data 
which fail to decide between place and 
response learning give an unequivocal 
answer to a question properly stated. 

The place-vs.-response question has 
been approached largely through the use 
of the T maze, rotated in its visual sur- 
round. A typical arrangement show- 
ing runs reinforced in “place” and “re- 
sponse” learning is shown in Fig. 1, 
along with the fixed-maze problem in 
which both place and response may be 
learned. In both place learning and re- 
sponse learning, the maze is rotated on 
alternate trials at random. The place 
learner is always to go to the same place 
in the room, responding consistently to 
extra-maze cues but making different 
turns on different trials. The response 
learner makes the same turn on all trials, 
going to different places. One supposed 
test of whether place or response learn- 


1Dr. Richard L. Solomon suggested and 
drafted part of this paper (see footnote 3). 
His guidance and help are gratefully acknowl- 
edged. 

2 Now at Michigan State University. 


ing is more dominant is to compare rates 
of learning on these two problems. An- 
other test is to train animals with the 
maze in a fixed position (place + re- 
sponse learning), and then rotate the 
maze for a test trial. The animal can 
now either make the same turn he has 
learned or go to the same place he has 
been going to, but not both. In this 
direct opposition experiment, the rela- 
tive number of animals taking each 
choice is a test of the relative dominance 
of place and respense. 

In 1946, Tolman, Ritchie, and Kalish 
(23) proposed that in such a situation 
place learning is more natural and primi- 
tive for the rat than response learning, 
and place will dominate response in all 
tests. Their experiment supported the 
hypothesis, but some later repetitions 
have found response dominating place, 
or have found no difference. 

An answer to these apparent contra- 
dictions is here sought by assuming that 
maze running depends on a multiplicity 
of cues, and that the rat learns differ- 
ential responses to relevant cues in a 
maze just as he would in a discrimina- 
tion box. A theory base? on learning in 
the discrimination box will be applied to 
the maze situation. Since the discrimi- 
nation theory to be used (15) is very 
close to earlier theories of the effect 
of sensory input on learning in mazes, 
theories which antedate the place-vs.-re- 
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PLACE + RESPONSE 


Fic. 1. Runs reinforced in place, response, 
and place + response (fixed maze) learning. 
Only one T maze is present on a given trial. 


sponse controversy, it is useful to place 
the recent studies within their historical 
context and analyze all the data at once. 


MuttTiPpLte Cur THEORY 


Hunter (9, 10) and Honzik (8), 
among others, have proposed that learn- 
ing and performance in mazes depends 
upon many cues, in all available modali- 
ties and from all sources. Depending on 
what stimuli are available to the rat, 
maze learning may depend on visual, 
auditory, olfactory, tactual, and kines- 
thetic stimuli. Visual, auditory, and 
olfactory stimuli may arise from within 
the maze or outside. The cues used by 
rats may be not only the specific physi- 
cal stimuli but also patterns or arrange- 
ments of stimuli, so that changes which 
preserve the arrangement may not dis- 
rupt performance. 

When a portion of the cues are re- 
moved by surgery or screening, or made 
irrelevant by interchanging maze units, 
rotating the maze, etc., learning of the 
maze is retarded, but mazes are often 
perfectly solved in the presence of known 
irrelevant cues. 


The earlier formulation of multiple cue 
theory is somewhat refined by consider- 
ing more recent theory derived from dis- 
crimination learning (15). This newer 
theory distinguishes between relevant 
cues, which bear a regular relationship 
to the correct path, and irrelevant cues 
which are not consistently related to the 
correct response. 

A cue with a constant or predictable 
relationship to the true path will in the 
course of learning become “conditioned” 
to the correct response. All such cues 
will eventually be learned, and all will 
play a part in performance. If a cer- 
tain cue is irrelevant, bearing a changing: 
and unpredictable relationship to the 
correct response, it cannot be the basis 
of learning and is only a distractor. 
Such cues will become “adapted” during 
learning, and eventually will play no 
part in performance. 

The rate of learning depends directly 
on the proportion of relevant, usable 
cues in the total set available. 

When learning has been based on a 
variety of relevant cues and then some 
of these cues are scrambled and made 
irrelevant, the amount of disturbance 
reflects the relative importance of the 
newly changed cues. Though irrelevant 
cues will eventually be adapted, making 
important cues irrelevant reduces the 
proportion of relevant cues and thus 
retards learning. Disruption due to 
scrambling of learned relevant cues is 
attributed to the fact that such cues are 
not adapted, having previously been 
relevant. Recovery from the disturb- 
ance results from progressive adaptation 
of the scrambled cues. 

If the learning program involves con- 
stantly introducing new cues which have 
not been present before, these new cues 
will always disturb performance, since 
they cannot have been attached to the 
correct response and they cannot have 
been adapted. 
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In an ordinary multiple-unit maze, 
the untrained rat responds erratically 
early in training, producing different 
kinesthetic stimuli on different trials. 
As a result, such stimuli are not con- 
sistently related to the true path, and 
alone they cannot be the basis of rapid 
learning. As performance based on 
other cues improves, the animal makes 
more nearly the same movement each 
trial, producing for himself a regular 
pattern of kinesthetic stimuli which can 
serve as partial basis for the learned 
habit. Highly skilled performance thus 
depends to some degree on kinesthesis, 
though such cues do not alone mediate 
learning to a significant degree. 

The above application of the theory 
of discrimination learning to mazes does 
not differ in important respects from the 
assumptions and conclusions of Hunter 
and Honzik. The main refinement is in 
attributing the rate of learning to the 
proportion of relevant cues. The ear- 
lier writers do not make just this state- 
ment, though their statements may be 
interpreted to this effect. 

No attempt will be made in this paper 
to apply the specific quantitative for- 
mulation of discrimination learning to 
multiple-unit maze learning. The effects 
peculiar to the serial character of maze 
learning are not reflected in discrimi- 
nation theory. When a single-unit T 
maze is used, as in the recent place-re- 
sponse experiments, discrimination the- 
ory should apply exactly and quantita- 
tive predictions should be correct. 

The multiple cue theory can be com- 
pared with the results of studies using 
multiple-unit mazes, with the under- 
standing that specific quantitative values 
(error scores, trials to learn, etc.) de- 
pend both on the discrimination of cues 
and on serial patterns. An understand- 
ing of these earlier studies of maze cues 
is essential in gaining a clear insight into 
the place-vs.-response controversy. 


Tue Historica, Context ® 


Studies of the sensory control of maze 
behavior of rats have been reviewed in 
detail by Munn (14), so only a broad 
summary will be given here, along with 
discussions of the main points. 

Early studies of the role of sensory 
processes in maze behavior were moti- 
vated by the early conclusion by Small 
(1901) and by Carr and Watson (1908) 
that in complex mazes rats form kines- 
thetically controlled habits which be- 
come “automatic.” This hypothesis is 
that kinesthetic cues, arising from one 
response in the maze, serve as the main 
cues to the next response in the se- 
quence. Each response is associated 
with the previous one, and other sensory 
input becomes unimportant in maintain- 
ing performance. This conclusion was 
based on the erroneous assumption that 
if one sort of cue is important, others 
must be unimportant. The technique 
was essentially one of eliminating a 
single sense modality at a time, in each 
case observing that performance remains 
essentially intact. The only modality 
not disturbed was kinesthesis, which was 
assumed to be “the” crucial one. Hunter 
(11) has shown the flaw in this line of 
argument. 

The “kinesthetic” hypothesis inspired 
a series of experiments designed to prove 
or disprove it. These experiments un- 
covered a wealth of information about 
the cues which in fact control behavior 
in various mazes. Fundamentally, the 
data were analyzed to determine the 
relative effectiveness of (a) kinesthetic 
stimuli, (2) intra-maze stimuli, such as 
visual, olfactory, and tactual stimuli 
from the maze itself, and (c) extra- 
maze stimuli, such as visual or auditory 


3 The bearing of these studies on the place- 
vs.-response question was pointed out to the 
author by Dr. R. L. Solomon. This section 
and the beginning of the next section are based 
in part on a personal communication from 
him. 
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stimuli from the room containing the 
maze. Intra- and extra-maze stimuli 
were broken down into components due 
to different sense modalities, again with 
the intention of evaluating each. 

The methods used in assessing the role 
of sensory events in maze behavior were 
as follows: (a) surgical interference with 
receptor organs and neural pathways; 
(5) elimination of stimuli from the 
intra-maze or extra-maze environment; 
(c) introduction of distinctive stimuli to 
the environment; and (d) the con- 
trolled rearrangement of intra-maze and 
extra-maze stimuli. 

In general, the results of these experi- 
ments were as follows: 

(1) Simple alternation mazes can be 
learned by the rat on the basis of kines- 
thetic cues alone, but more complex 
mazes cannot be learned in any reason- 
able number of trials without the aid of 
visual, olfactory, or auditory cues. Re- 


moval of kinesthetic cues by surgical 


means does not greatly affect maze per- 
formance if intra- or extra-maze cues are 
left intact. However, once a maze per- 
formance is perfected on the basis of 
intra- or extra-maze cues, removal of 
such cues does not destroy the perform- 
ance completely. As one would expect 
from the multiple cue theory, kinesthetic 
cues are relevant and conditioned to the 
correct response only when a regular re- 
lationship between the last response and 
the next correct response is established. 
Such a relationship exists in a simple 
alternation maze early enough for learn- 
ing to take place. In more complex 
mazes, usable kinesthetic cues exist only 
after the maze is learned on some other 
basis. Since during learning kinesthetic 
cues are mostly irrelevant anyway, their 
removal does not retard learning. 

(2) Mazes can ordinarily be learned 
on the basis of intra-maze cues alone, 
even if extra-maze cues are made irrele- 
vant by rotating the maze in the room. 
Such rotation retards learning, however, 
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especially if the maze is elevated and 
the room contains conspicuous visual 
cues. If the maze is kept in place in the 
room and its spatial arrangement is pre- 
served while units of the maze are inter- 
changed to make intra-maze stimuli ir- 
relevant, rats can still solve the prob- 
lem. Again, interchanging of stimuli 
retards learning. In many mazes, it 
should be noted, all blinds have a com- 
mon visual appearance, being shorter 
than true alleys and ending within sight 
of the rat at the choice point. Inter- 
changing units does not make such 
stimuli irrelevant. But even if the spa- 
tial pattern of the maze is scrambled, if 
the goal is in a fixed position relative to 
the extra-maze environment the rat can 
learn to run to the goal box without fol- 
lowing alleys which lead away from the 
food. Thus, intra-maze and extra-maze 
cues are each separately capable of sus- 
taining learning and performance in the 
maze. With one type scrambled, per- 
fect performance can often be attained 
based on the other type, indicating that 
irrelevant cues are eventually disre- 
garded. Retardation of learning due to 
removal or scrambling of important cues 
is consistently observed. 

(3) In elevated mazes, where most of 
the visual field arises from outside the 
maze, extra-maze cues are usually more 
important than intra-maze cues. If the 
maze is enclosed in a homogeneous 
room, however, extra-maze cues are rele- 
gated to a minor role. Intra-maze cues 
are generally more important in alley or 
tunnel mazes, where the rat has at best 
an obstructed view of the outside. In 
a unidirectional maze, extra-maze cues 
are more important than they are in a 
maze which requires the rat to run in 
many different directions. The results 
support the idea that either type of cue 
may be the more important, depending 
on the relative amount of relevant stimu- 
lation stemming from each source. 

(4) In general, two ways of assessing 
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the importance of a certain cue give 
comparable estimates. One method is to 
scramble the cue during learning, as- 
sessing its importance by the relative 
retardation in the rate of learning. The 
other method is to have the rat learn 
the maze with the cue relevant, then 
scramble it. The amount of disruption 
of the perfected habit indicates the im- 
portance of the cue. To an approxima- 
tion, these two methods rank various 
classes of cues the same way. Remov- 
ing a type of cue by depriving the ani- 
mal of necessary receptors, as by blind- 
ing or deafening the animal or rendering 
it anosmic by surgical interference, is 
not the same thing as scrambling cues 
experimentally. Blinding, for example, 
removes both relevant and irrelevant 
visual cues, whereas rotating the maze 
makes otherwise relevant extra-maze cues 
irrelevant, and does not affect intra-maze 
visual cues. Thus, exact comparisons 
between the effects of scrambling and 
the effects of surgery cannot usually be 
made. 

These findings suggest that kinesthetic 
cues are unimportant in the learning of 
complex mazes, but that both intra- and 
extra-maze cues are important. The 
relative importance of various types of 
cues depends on the maze and its sur- 
round, for rats seem to use various cues 
proportionally as those cues are avail- 
able and relevant. Rats seem capable 
of overcoming the distraction of irrele- 
vant cues. 

The results are consistent with the 
conception that maze learning and per- 
formance depend on multiple cues, and 
that such cues are discriminated and re- 
sponses learned to them in accord with 
the theory of discrimination learning. 


“PLACE” AND “RESPONSE” IN SINGLE- 
Unit T Mazes 


In the light of the earlier extensive 
studies of the sensory basis of maze run- 
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ning, studies of place and response in T 
mazes appear as comparisons of extra- 
maze and kinesthetic cues. The Tol- 
man-Ritchie-Kalish hypothesis that rats 
learn “places” rather than “responses” 
means, in the earlier terminology, that 
extra-maze cues are more important than 
kinesthetic cues. Since in earlier studies 
kinesthetic cues were found barely suffi- 
cient to permit any learning at all, 
whereas extra-maze visual cues bulk 
large in importance when they are avail- 
able, it would appear that the dominance 
of “place” learning was indubitable, and 
the Tolman-Ritchie-Kalish experiment 
redundant. 

It should be noted that Tolman’s 
“place” formulation is inferior to ear- 
lier formulations. Tolman does not 
specify what cues are thought to make 
up “place” indications, and he does not 
identify “response” learning as learning 
based solely on kinesthetic cues. Though 
intra-maze cues do not seem to be “re- 
sponse” cues in Tolman’s sense, it is not 
clear whether they are “place” cues or 
not. 

Though the hypothesis that place 
learning will dominate response learn- 
ing seems to follow in a general way 
from Tolman’s concept of cognitive 
maps, his position was not really contro- 
versial because stimulus-response theo- 
rists did not believe response learning 
would be easier than place learning. 
The early Carr-Watson hypothesis, that 
maze habits are mainly controlled by 
kinesthesis, was by 1946 buried under 
conflicting evidence, and S-R_ theories 
stated that the stimulus components of 
the maze habits were quite likely to be 
extra-maze visual cues, especially if the 
maze is elevated and in a room full of 
such cues. 

From these considerations, one should 
expect that the dominance of place 
learning is a foregone conclusion. The 
existence of a “controversy” is itself a 





222 FRANK 
surprise. In order to show the nature 
of this controversy it is necessary first 
to consider how place-response experi- 
ments have been conducted and what 
results have been obtiined. Since this 
recent literature has not received a de- 
finitive review, a relatively thorough 
analysis is presented here. 


Tests of Place-vs.-Response Dominance 


We first consider whether place learn- 
ing in fact dominates response learning 
in the single-unit maze. Ten studies 
have compared the rates of place and 
response learning in single-unit T mazes 
or slight modifications thereof. Of these, 
seven found place learning faster (1, 3, 
5, 22, 23, 24, 25), and two found re- 
sponse learning faster (7, 20), while one 
found that either could be faster de- 
pending on the intertrial interval (21). 
In direct opposition tests (rotation of 
the maze after the habit is learned on 
a fixed maze), one study showed that 
either place or response could be su- 
perior depending on the shape of the 
approach stem (18), another showed 
that either could be dominant depend- 
ing on differential cues (27), and two 
others found response tendencies over- 
riding place tendencies (2,12). A modi- 
fication of the opposition test consists 
of setting up conditions in which the 
animal alternates. Several studies have 
asked whether an animal alternates re- 
sponse or stimulus (place) character- 
istics (4, 6, 13, 26), and the answer 
has always been that place alternation 
is stronger than response alternation. 
These studies used enclosed alley mazes, 
“place” cues being especially distinctive 
intra-maze cues. 

In summary, if we merely count titles, 
the impression is received that place 
tendencies are usually learned faster 
than responses, and that they are some- 
times stronger in opposition tests, espe- 
cially those depending on alternation. 
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One certainly cannot draw the conclu- 
sion that place learning is always domi- 
nant, though, for response tendencies 
dominate in at least some conditions of 
seven different studies. 


Conditions Associated With Place and 
Response Dominance 


Since neither place nor response is 
uniformly dominant, we may consider 
the experimental conditions which make 
one or the other stronger. 

Several writers (1, 7, 21, 25) have 
suggested that the relative dominance of 
place depends on the amount of differ- 
ential visual stimulation. This is, of 
course, the position taken by earlier 
writers on general maze learning, though 
place-vs.-response studies are not usu- 
ally analyzed in terms of this variable. 
Counting heads in the experimental re- 
ports, we see that the use of a homoge- 
neous visual surround (a dome or en- 
closure, usually made of muslin, which 
prevents the animal from discerning any 
uncontrolled stimuli from the room about 
him) greatly predisposes rats to learn re- 
sponses instead of places. All four ex- 
periments using such domes report re- 
sponse dominance (2, 7, 20, 21), though 
one (21) showed that response domi- 
nance could be neutralized by massing 
trials. The only other cases of response 
dominance reported (12, 18) were in 
plain rooms under low illumination, 
which may be thought of as approxima- 
tions to dome-type enclosures. All other 
studies showing place dominance were 
done in open rooms (1, 3, 5, 22, 23, 24, 
25) or in alley mazes with strong dif- 
ferential cues (6, 13, 26). One study 
showed that when the two places con- 
trast sharply in illumination, place domi- 
nates, whereas when the illumination is 
more nearly equal, response tendencies 
dominate. 

In summary, place tendencies domi- 
nate when visual stimuli at the two ends 
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of the maze are very unlike, and re- 
sponse tendencies dominate when such 
stimuli are relatively alike. In every 
case the domination seems to be quanti- 
tative—both place and response tenden- 
cies exist, but one is stronger than the 
other depending on stimulus conditions. 

By inspecting the experimental reports 
we can get some idea of what consti- 
tutes a strong place cue for the rat. 
The most dramatic place dominance was 
found in two studies (17, 23) in which 
rat cages were located to one side, 
nearer one goal than the other. In one 
of these studies (17) it was found that 
the rats would not give up responding 
to one place in the room, despite a num- 
ber of controls, until the cages were 
moved, following which almost all the 
rats reversed. Stimuli such as lights had 
only a slight effect on performance, com- 
pared with the rat cages. 

When rat cages are placed directly 
behind the starting point and are thus 
not available as place cues, or when the 
room is devoid of cages, such visual 
stimuli as windows (in daylight studies) 
give rise to strong place preferences (1, 
3), as do well-lighted rooms with many 
small objects in them (5, 25). Some- 
what less striking but still consistent place 
dominance was shown in two studies in 
which one wall of a plain room (the 
wall behind one goal), was moderately 
well illuminated, the other wall not be- 
ing illuminated at all (22, 24). The 
power of room cues can be estimated by 
noting the quick learning of a successive 
discrimination between rooms (19). At- 
tenipts to give differential cues inside 
domes by illuminating a 10-inch disc 
behind one goal with a 7.5-watt lamp 
(2, 12) or using 7.5-watt lamps them- 
selves as cues, with (7) or without (20) 
overhead illumination, did not lead to 
place dominance. Symmetrical overhead 
illumination in an empty room produced 
response dominance with spaced practice 
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(21). A small lamp on the floor below 
one goal, pointed to throw long shadows 
on the floor, gave slightly more place 
than response tendency (18). Ratio of 
illuminations rather than difference seems 
to be a crucial variable (27). 

In summary, rat cages are very strong 
place stimuli, windows or objects in a 
well illuminated room rank next in 
power, and fairly strong differential 
lighting of walls of a plain room is also 
effective. Discs lit by 7.5-watt lamps, 
or such lamps themselves, are not usu- 
ally enough to make place dominant 
over response in an otherwise somoge- 
neous setting. 

Some other variables are associated to 
some degree with place and response 
dominance. For example, all experi- 
ments using pigmented rats, (4, 22, 23, 
24) show place dominance, except for 
one (12) which showed cases of both 
place and slight response dominance un- 
der low illumination. But white rats 
also show place dominance under good 
illumination (1, 3, 5, 25), so we may 
reason that it just happened that ex- 
perimenters who intended to use strong 
visual cues also sometimes used pig- 
mented rats to take advantage of their 
superior vision. The pigmentation of 
the rat cannot be shown to have any 
other effect on place and response domi- 
nance. 

The noncorrection procedure (picking 
up the rat after an error) seems to lead 
to better response learning than the cor- 
rection procedure (allowing the rat to 
find the food by retracing after an 
error). Using the place-response stud- 
ies, the effects of correction cannot be 
isolated from the visual cue effects men- 
tioned above. 

One study (21) indicates that re- 
sponse learning is slowed by massing 
trials. Little more about massing ef- 
fects in these studies is known, but most 
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of the studies employ at least moderate 
trial spacing. 

There seems to be no interesting re- 
lation between place-vs.-response domi- 
nance and either the ages of rats used 
or the size of the maze employed. The 
slight tendency for smaller mazes to go 
with response dominance is more than 
explained by the tendency of experi- 
menters using domes to use slightly 
smaller mazes for convenience. 


Evaluation of the Place-vs.-Response 
Controversy 


The details of the single-unit T maze 
experiments quite clearly indicate that 
there is nothing in the nature of a rat 
which makes it a “place” or “response” 
learner. The main factor determining 
the outcome of place-vs.-response ex- 
periments is the amount of extra-maze 
visual stimulation which differentiates 
the region around and behind one goal 


from the region around and behind the 


other. Such visual cues are relevant in 
place learning and irrelevant in response 
learning. 

It seems reasonable to conclude that 
the place-vs.-response controversy, which 
seemed ill formulated when compared 
above with earlier maze studies, gives 
a distorted and confusing interpretation 
of the experiments designed to settle it. 

It should be remarked that in single- 
unit mazes, kinesthetic cues appear suffi- 
cient to support quite rapid /earning. 
This conflicts with Honzik’s conclusion 
that “when all other avenues of stimula- 
tion are destroyed, kinaesthesis is help- 
less” (8, p. 56). Honzik’s statement, if 
taken as applying generally to all mazes 
including single-unit ones, would be in- 
correct. The importance of a sense mo- 
dality depends on the richness and rele- 
vance of stimuli in that modality which 
exist in the maze situation. In single- 
unit mazes, relevant kinesthetic stimuli 
apparently abound. In Honzik’s 14- 
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unit maze, animals do not make regular 
enough runs to give themselves a con- 
stant set of kinesthetic stimuli on which 
to build accurate performance. One 
may also consider Hunter’s point that 
in complex mazes, with left and right 
turns required in irregular order, simple 
kinesthetic cues are irrelevant. Attempts 
to state the relative importance of sense 
modalities or of intra- and extra-maze 
cues in general, for all mazes, are akin 
to the “place-vs.-response” hypothesis in 
that they fail to incorporate the most 
important variable, the stimulus situa- 
tion presented to the animal. Such 
attempts are, accordingly, doomed to 
failure. 


PREDICTION OF T-MazeE DATA BY THE 
QUANTITATIVE THEORY OF Dis- 
CRIMINATION LEARNING 


The single-unit T maze, being devoid 
of the serial characteristics of more com- 
plex mazes, may be thought of as a 
kind of discrimination-learning appa- 
ratus. When the T maze is rotated on 
random trials as in place-response ex- 
periments, extrz-maze (place) and kin- 
esthetic (response) cues are uncorre- 
lated.* In place learning (see Fig. 1) 
place cues are relevant, and all others 
including response cues are irrelevant. 
In response learning, response cues are 
relevant and all others including place 
cues are irrelevant. In place + response 
learning, with a fixed maze, both place 
and response cues are relevant. 

We may entertain the hypothesis of 
cue-additivity: that the set of cues rele- 


*If the whole maze is rotated as a unit, 
intra-maze cues are relevant in response learn- 
ing and are confounded with kinesthetic cues. 
If the starting stem is moved but the cross- 
arm is left in place, intra-maze cues are rele- 
vant in place learning, and are confounded 
with extra-maze cues. We shall assume that 
intra-maze cues combine additively with the 
cues they are correlated with, and shall not 
specify what happens to them. 
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vant in place + response learning is 
simply the sum of the place and response 
cues. This hypothesis can be tested 
quantitatively, using a theory of dis- 
crimination learning (15). In this the- 
ory, the rate of learning is set equal to 
the proportion of relevant cues, and the 
single number is called 6. The learning 
curve (or statistics such as total errors 
to mastery, total errors in » trials, trials- 
to-criterion, etc.) may be used to esti- 
mate @: and, conversely, given 6 the 
learning curve or any of its statistics 
can be computed in advance of experi- 
mentation. The equations involved and 
the methods of computation will not be 
repeated here. 

Galanter and Shaw (5) used three 
groups of rats trained in the same ap- 
paratus and surround, under conditions 
of place learning, response learning, and 
fixed-maze or place + response learning, 
respectively. Using median trials-to-cri- 
terion ° as an index of typical perform- 
ance, the proportion of relevant cues in 
each problem was computed, using the 
equations of discrimination-learning the- 
ory. Theoretically, the proportion of 
relevant cues in the place + response 
problem should be the sum of the pro- 
portions in the other problems: @p,z = 
6p + 6p. The results of the computa- 
tions to check this hypothesis are shown 
in Table 1. Place + response perform- 
ance is predicted using only data from 
the place and response groups. Inspec- 
tion of Table 1 indicates that the pre- 
diction is relatively accurate, although, 
since subjects in the place + response 


5 Since retracing and “false-start” responses 
were counted as errors, animals had a higher 
probability of making an error than a correct 
response at the beginning of training. This 
bias in favor of errors has relatively little 
effect on trials-to-criterion, which is accord- 
ingly used as an index of performance. The 
parameter @ is estimated from trials-to-cri- 
terion by an approximation to the maximum 
likelihood method. 


TABLE 1 


MeEpDIAN TRIALS TO 10-out—oF-10 CRITERION 
AND PROPORTION OF RELEVANT CUES (6) 
IN THE GALANTER-SHAW EXPERIMENT 








Observed Predicted 


Relevant Cues 





Trials 6 
} 


Trials 6 





Place 5.0 | .34 —_ 
Response 10 ~ 


Place + Response 20; 58 | 3.2 44 














group ranged in trials-to-criterion from 
O to 3, the prediction is at the edge of 
the obtained distribution of scores.® 
Scharlock (20) used a maze in a dome 
with either one light behind one goal or 
lights behind both. If there was one 
light it was the only source of differ- 
ential place cues, so if there were two 
lights there were no place cues. Schar- 
lock ran place, response, and place + re- 
sponse groups with one light. He also 
had a control group which made no 
progress on place learning with both 
lights—an expected result confirming 
that with a light behind each goal there 
were no place cues. One other group, 
which we may call “response-minus- 
place,” learned a response with both 
lights on, and thus with no place cues 
available. Here again we can predict 
that the place + response group will 
yield a learning rate, 6p,z, which is the 
sum of the rates of the place and re- 
sponse groups, 6p + 6g. In addition, 
the learning rate of the response-minus- 
place group should be faster than that 
of the response group, because of the 


6 Dr. Galanter, in a personal communication, 
noted that the place + response group was run 
after the other parts of the experiment were 
completed. In Galanter’s opinion, the experi- 
menters were by this time somewhat more 
skillful, and the place + response group had 
an advantage. This might account for the 
discrepancy between prediction and observa- 
tion. 
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TABLE 2 


Errors IN 28 TRIALS AND PROPORTION OF 
RELEVANT Cues (6) IN THE SCHARLOCK 
EXPERIMENT 








| Observed | Predicted 


Relevant Cues | 


6 | Errors 6 


| .216*| — 
|.296 | — 
.000 


Place 

Response 

Place learning, no place 
cues (2 lights) 

Place + Response 

Response — Place: no place 
cues (2 lights) 


3.8 
17.50 

2.28 

2.84 


A445 | 
366 





* Because of initial biases, the probability of correct 
response on the first trial is estimated at about .375 
for all groups. The correction is made in the fashion 
shown in (16). 


elimination of irrelevant place cues. 
Since @p is the proportion of differential 
place cues, it follows theoretically that 
Oz -p = Op/(1— 6p). The results of 
computations to check these hypothe- 
ses are shown in Table 2. Inspection of 
Table 2 shows that the predictions are 
quite accurate. The discrepancies be- 
tween predictions and observations are 
not statistically significant. 

A third experiment by Blodgett, Mc- 
Cutchan, and Mathews (3) separates 
location and direction cues. The rat 
may approach the same location (for 
instance, the center of the room) from 
either of two directions if the maze is 
shifted appropriately. Location and di- 
rection are usually lumped as place cues, 
but in this experiment they are sepa- 
rated. Seven groups constitute the ex- 
periment, with one group learning each 
problem possible: location, direction, re- 
sponse, all combinations of two relevant, 
and the combination of all three rele- 
vant (fixed maze learning). 

The data were reported in terms of 
“cycles,” pairs of trials, which contained 
at least one error. From this it is not 
possible to make good estimates of 6, 
but an effort has been made to attain 
fair approximations. The estimates, and 
the corresponding predictions made by 
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adding 6-values of problems with fewer 
relevant cues, are shown in Table 3. 
The results seem quite encouraging, un- 
der the circumstances. 


SUMMARY AND CONCLUSIONS 


Consideration of early studies of the 
sensory basis of maze learning, and re- 
view of place-vs.-response experiments, 
indicate that: 

(1) There is nothing in the nature of 
a rat which makes it a “place” learner, 
or a “response” learner. A rat in a maze 
will use all relevant cues, and the im- 
portance of any class of cues depends 
on the amount of relevant stimulation 
provided as well as the sensory capaci- 
ties of the animal. In place-response 
experiments, the importance of place 
cues depends on the amount of differ- 
ential extra-maze stimulation. 

(2) A multiple-cue theory of maze 
learning is successful in comprehending 
the major results of experiments using 
complex mazes, and the detailed re- 
sults of place-response experiments using 
single-unit T mazes. 

(3) Useful refinements of classical 
multiple-cue theory were taken from 
discrimination-learning theory. These 
are that irrelevant cues are adapted 
during learning, and that the rate of 


TABLE 3 


PROPORTION OF RELEVANT CUES (6) 
ESTIMATED AND PREDICTED IN 
THE BiopGcett, McCuTCcHAN, 
AND MATHEWS EXPERIMENT 








Relevant Cues eer oe or we 





Location 02 —_ 
Direction Al _ 
Response .08 _ 
Location + Direction 13 13 
Location + Response .10 10 
Direction + Response 13 18 
Location + Direction 18 .21 

+ Response 
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learning depends on the proportion of 
relevant cues. 

(4) Quantitative analysis of the re- 
sults of certain place-response experi- 
ments indicates that place and response 
cues combine additively in the place + 
response (fixed maze) problem. 

The writer’s general conclusion is that 
further “definitive” studies of the place- 
vs.-response controversy, to prove that 
rats are by nature either place or re- 
sponse learners, would be fruitless since 
the issue is incorrectly drawn. How- 


ever, use of the T maze to analyze the 
stimuli in maze learning holds promise 
of yielding a consistent quantitative ac- 
count of how rats find their way. Such 
studies can build on the earlier work on 
more complex mazes. 


REFERENCES 


. Biopcett, H. C., & McCutcnan, K. Place 
versus response learning in the simple T- 
maze. J. exp. Psychol., 1947, 37, 412- 
422. 

. Broncett, H. C., & McCutcnan, K. Rela- 
tive strength of place and _ response 
learning in the T maze. J. comp. 
physiol. Psychol., 1948, 41, 17-24. 

. Broncett, H. C., McCurtcnan, K., & 
Matuews, R. Spatial learning in the 
T-maze: the influence of direction, turn, 
and food location. J. exp. Psycheol., 
1949, 39, 800-809. 

. Gatanter, E. H. Place and response 
learning: learning to alternate. J.comp. 
physiol. Psychol., 1955, 49, 17-18. 

. Garanter, E. H., & SHaw, W. A. “Cue” 
vs. “reactive inhibition” in place and re- 
sponse learning. J. comp. physiol. Psy- 
chol., 1954, 47, 395-398. 

. Granzer, M. The role of stimulus satia- 
tion in response alternation. J. exp. 
Psychol., 1953, 45, 387-393. 

. Hm, C. W., & THune, L. E. Place and 
response learning in the white rat under 
simplified and mutually isolated condi- 
tions. J. exp. Psychol., 1952, 43, 289- 
297. 

8. Honzrx, C. H. The sensory basis of 
maze learning in rats. Comp. Psychol. 
Monogr., 1936, 13, 1-113. 


9. Hunter, W. S. The sensory control of 
the maze habit in the white rat. J. 
genet. Psychol., 1929, 36, 505-537. 

. Hunter, W. S. A further consideration 
of the sensory control of the maze habit | 
in the white rat. J. genet. Psychol, 
1930, 38, 3-19. 

. Hunter, W. S. A consideration of Lash- 
ley’s theory of the equipotentiality of 
cerebral action. J. gen. Psychol., 1930, 
3, 455-468. 

. McCurtcuan, K., RetuimcsHarer, D., & 
Nicnots, J. W. The role of response 
and place learning under alternating 
hunger and thirst drives. J. comp. 
physiol. Psychol., 1951, 44, 269-275. 

. Montcomery, K. C. A test of two ex- 
planations of spontaneous alternation. 
J. comp. physiol. Psychol., 1952, 45, 
287-293. 

. Munn, N. L. Handbook of psychologi- 
cal research on the rat. New York: 
Houghton Mifflin, 1950. 

. Restite, F. A theory of discrimination 
learning. Psychol. Rev., 1955, 62, 11- 
19. 

. Restrz, F. Theory of selective learning 
with probable reinforcements. Psychol. 
Rev., 1957, 64, 182-191. 

. Rrrcute, B. F. Studies in spatial learn- 
ing: III. Two paths to the same loca- 
tion and two paths to different loca- 
tions. J. exp. Psychol., 1947, 37, 25-38. 

. Ritcuir, B. F., AEScHIT™MAN, B., & PEIRCE, 
P. Studies in spatial learning: VIII. 
Place performance and the acquisition 
of place dispositions. J. comp. physiol. 
Psychol., 1950, 43, 73-85. 

. Ritcnre, B. F., Hay, A. & Hare, R. 
Studies in spatial learning: IX. A dis- 
positional analysis of response perform- 
ance. J. comp. physiol. Psychol., 1951, 
44, 442-449. 

. Scpartock, D. P. The role of extramaze 
cues in place and response learning. J. 
exp. Psychol., 1955, 50, 249-254. 

. THompson, M. E., & Tompson, J. P. 
Reactive inhibition as a factor in maze 
learning: II. The role of reactive inhi- 
bition in studies of place learning versus 
response learning. J. exp. Psychol., 
1949, 39, 883-891. 

. Torman, E. C., & GrerrTman, H. Studies 
in spatial learning: VII. Place and re- 
sponse learning under different condi- 
tions of motivation. J. exp. Psychol, 
1949, 39, 653-659. 





228 


23. Torman, E. C., Ritcur, B. F., & KaisH, 
D. Studies in spatial learning: II. Place 
learning versus response learning. J. 
exp. Psychol., 1946, 35, 221-229. 

24. Torman, E. C., Ritcute, B. F., & Kavisu, 
D. Studies in spatial learning: V. Re- 
sponse learning vs. place learning by 
the non-correction method. J. exp. Psy- 
chol., 1947, 37, 285-292. 

25. Wappet, D., Gans, S., Kempner, P., & 
WuuumMs, A. A comparison of place 
and response learning in very young 


FRANK RESTLE 


rats. J. comp. physiol. Psychol., 1955, 
48, 375-377. 

26. WaLker, E. L., Demper, W. N., Eart, 
R. W., & Karory, A. J. Choice alter- 
nation: I. Stimulus vs. place vs. re- 
sponse. J. comp. physiol. Psychol., 
1955, 48, 19-23. 

27. Wess, W. B. A study in place and re- 
sponse learning as a discrimination be- 
havior. J. comp. physiol. Psychol., 
1951, 44, 263-268. 


(Received September 21, 1956) 





os arr ical Review 
Vol. 64, No. 4, 1957 


LATENT EXTINCTION AND THE FRACTIONAL 
ANTICIPATORY RESPONSE MECHANISM* 


HOWARD MOLTZ 
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I 


Most students of learning would agree 
that in order for an organism to acquire 
an adaptive behavior pattern not only 
must certain appropriate response be 
learned, but other inappropriate. or non- 
adaptive ones must be extinguished.” 
It is not surprising, therefore, that the 
attempt to provide an explanation of 
the manner in which response extinction 
occurs is at present a strategic area of 
concern for current theories of learning. 
Of importance to such theories is a 
procedure, employed recently in several 
studies (11, 32, 36, 38), that was de- 
signed to demonstrate that the strength 
of an instrumental response can be 
weakened prior to its first nonrewarded 
performance. This procedure involves 
placing experimental animals (rats) , fol- 
lowing the termination of training, di- 
rectly into a now empty but previously 
baited goal box for a period judged suf- 
ficient to “inform” them of its pres- 
ent nonrewarding character. The effect 
upon response strength was observed 
when the subsequent performance of 
these Ss in approaching the empty goal 
location was compared with that of 
control animals which had not received 
this pre-extinction experience. The pro- 


1 The writer wishes to acknowledge with ap- 
preciation the helpful comments of his col- 
leagues, Professors Wayne Dennis, Elizabeth 
Fehrer, and David Raab. 

2 Extinction will be defined as the reduction 
in response strength that occurs following non- 
reinforcement. The mechanism mediating this 
reduction is left open, since in the opinion of 
the present author much more data are re- 
quired before a complete and empirically ade- 
quate explanation of the extirction process 
can be formulated. 


cedure whereby an animal is introduced 
directly into an unbaited goal location 
after having acquired a response instru- 
mental in securing a reward object previ- 
ously contained in that location will be 
referred to as “latent extinction.” While 
some investigators have questioned the 
effectiveness of this procedure (9, 37), 
others (10, 11, 23, 29, 32, 38) have 
demonstrated conclusively, in several 
different learning situations, that latent 
extinction does produce a decrement in 
the strength of the response to be elimi- 
nated. The fact that latent extinction 
has proven effective in this respect has 
been considered embarrassing to the S-R 
reinforcement theorist, since the impli- 
cation appears to be that “. . . all that 
seems necessary for extinction is for the 
organism to perceive the absence of re- 
inforcement” (12, p. 62). It is the pur- 
pose of the present paper to attempt to 
interpret the effects of latent extinction 
in a manner consistent with an S-R 
theoretical approach and, on the basis 
of this interpretation, to suggest hy- 
potheses for further empirical study. It 
should be emphasized that the term “la- 
tent extinction” will be used throughout 
to denote a particular experimental pro- 
cedure and mot a process or a state of 
the organism. 

Latent extinction has been employed 
primarily in two essentially different 
situations. The situation we will here- 
after refer to as Type I was designed to 
measure the effect of latent extinction 
on the strength of a previously estab- 
lished instrumental response. The Type 
I situation involves training an animal 
to traverse a straight alley or acquire a 
position habit in a maze in order to ob- 
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tain a reward object. Following train- 
ing, the animal is placed directly into 
the goal location in the absence of the 
reward object for a predetermined pe- 
riod (latent extinction) after which 
“regular” response extinction is admin- 
istered. The effect of latent extinction 
in reducing the strength of the in- 
strumental response is indexed by the 
number of trials required to reach an 
extinction criterion. An experiment re- 
ported recently by Deese (11) serves 
to illustrate the Type I procedure. 
Deese trained rats to a position habit 
in a U maze with food as the reward. 
Following the termination of training, 
the experimental Ss were placed in the 
now empty goal location for four 1-min. 
periods. When these Ss were subse- 
quently run to extinction, they yielded 
a significantly smaller proportion of cor- 
rect responses than control Ss that had 
not received the latent extinction experi- 
ence. Deese’s results indicate that a re- 
sponse can be weakened by placing S 
in the goal location without the pres- 
ence of the reward object and that 
therefore, in at least some learning 
situations, performance is mot a neces- 
sary condition for extinction. 

The procedure that we will hereafter 
refer to as Type II is concerned with 
the effect of latent extinction on the ca- 
pacity of secondary reward stimuli to 
mediate the learning of a mew response 
in the absence of primary reinforcement. 
The trainirg employed in the Type II 
situation involves the use of differential 
reinforcement in a straight alley. By 
means of this technique, the cues in the 
baited goal box come to acquire sec- 
ondary reward value while those of an 
unbaited goal box of a different color do 
not. Following the termination of train- 
ing, the animal is given latent extinction 
which, as in the Type I situation, con- 
sists of introducing him directly into 
the positive goal box in the absence of 
the reward object. The two goal boxes 
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are then placed on a T maze and the 
number of choices to the side of the 
maze containing the positive goal box is 
taken as an index of the effect of latent 
extinction on the capacity of secondary 
reward stimuli to mediate new learning. 
The results of a recent experiment by 
Moltz and Maddi (33), to be discussed 
in detail below, attest to the efficacy of 
latent extinction in reducing the ac- 
quired reward value of previously neu- 
tral stimuli. 


II 


An expectancy theory of extinction 
(44) encounters little difficulty in pro- 
viding an interpretation of the effects of 
latent extinction, since it places no spe- 
cial requirement on the response to be 
eliminated. Ail that appears necessary 
to decrease an animal’s readiness to re- 
spond in accordance with an established 
expectancy is to change the relationship 
between environmental objects, and to 
allow the animal to experience the new 
relationship on one or more occasions. 
Since the acquisition of a new cognitive 
pattern only requires that S, when mo- 
tivated (45), experience stimulus events 
in spatial and temporal contiguity, re- 
sponse performance as such is not made 
a necessary component of the extinction 
process. Thus placing S directly into 
the goal location, and permitting him 
to observe that the reward object is no 
longer present, should weaken his previ- 
ously acquired cognition concerning the 
character of the goal location. As a 
consequence, he will show a decreased 
tendency to respond in accordance with 
this cognition. 

Hull’s inhibition theory of extinction 
(21), on the other hand, encounters a 
great deal of difficulty in providing an 
explanation of the effects of latent ex- 
tinction (especially when latent extinc- 
tion is employed within the framework 
of a Type I situation), since the per- 
formance of the response to be elimi- 
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nated is considered a necessary com- 
ponent of the extinction process. Re- 
active inhibition (JR) and conditioned 
inhibition (S/R), which are assumed to 
produce extinction by jointly opposing 
reaction potential (S“R), develop only 
in the course of response performance. 
This implies that either the response to 
be eliminated or some other response 
from which extinction effects can gen- 
eralize must be performed before extinc- 
tion can take place. Since the latent 
extinction procedure does not require 
that an animal perform the response to 
be eliminated, it could not produce an 
increase in the inhibitory factors that 
oppose reaction potential, and conse- 
quently could have no effect upon re- 
sponse strength. But while it is clear 
that Hull’s response-induced inhibition 
theory of extinction requires the per- 
formance of the response to be elimi- 
nated, and is thus not adequate to ac- 
count for the effects of latent extinction, 


Hull’s general behavior theory makes no 


such requirement. In a series of early 
papers (15, 16, 17, 18, 19, 20), Hull ex- 
plored the possibility of applying a num- 
ber of explanatory constructs to com- 
plex behavioral relationships in which 
reasoning, insight, knowledge, and pur- 
pose appeared to be operative. One of 
these constructs—the fractional antici- 
patory goal response—was used subse- 
quently by others to mediate a wide 
range of behavior phenomena. For ex- 
ample, Kendler’s analysis of switching 
behavior (25, 26, 27) and Osgood’s me- 
diation hypothesis (35) both rely heavily 
on the explanatory capacity of the frac- 
tional anticipatory response mechanism. 
Following the leads contained in Hull’s 
early theoretical article, Spence (40, 41, 
43) has provided a somewhat more rig- 
orous analysis of the role played by this 
mechanism in the acquisition of a T- 
maze choice response, and has used this 
analysis to integrate latent-learning data 
and data concerned with the acquisition 
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of opposed spatial responses under the 
simultaneous presence of hunger and 
thirst. It is the opinion of the present 
author that the fractional anticipatory 
goal response possesses the deductive 
potential necessary to mediate also an 
interpretation of latent extinction data 
that is consistent with an S-R reinforce- 
ment approach. 

Spence has suggested (40) that, as 
a function of differential reinforcement 
during training, the stimulus traces of 
the cues in the positive goal box and 
those in the alley leading to it become 
(classically) conditioned to the goal 
response (e.g., eating or drinking). 
Through generalization, cues at the en- 
trance to the correct alley acquire the 
capacity to evoke that fractional com- 
ponent of the goal response (r,) not 
in conflict with the overt acts of the 
behavior sequence. This component 
produces a characteristic proprioceptive 
stimulus (s,) which in turn becomes 
conditioned to entering and continuing 
locomotion in the alley leading to the 
positive goal box. Through the acquisi- 
tion of this associative connection, s, 
becomes a component of the habit 
strength determining the instrumental 
response. It has also been suggested 
that “through the intensity of its trace 
which . . . increases with the increased 
strength of r, as the latter becomes more 
strongly conditioned, s, determines the 
non-associative factor K” (incentive mo- 
tivation) (40, p. 273). Spence implies 
that the magnitude of this nonassocia- 
tive component is an increasing mono- 
tonic function of the intensity of sy. 
Since the latter, in turn, is assumed to 
covary positively with the strength of 
r,, such variables as the amount of 
reinforcement and the delay of rein- 
forcement (which presumably affect the 
strength of r,) play a role in determin- 
ing the value of K. If it is recalled that 


SER = f (M x SAR), 
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where 
M=f(K+V+D), 


it can be seen that r, and its charac- 
teristic proprioceptive cue determine the 
excitatory strength of the correct re- 
sponse by contributing a component to 
S#R and also by determining the value 
of K.* 

But while the fractional anticipatory 
goal response has been considered by 
some students of learning to be a con- 
struct that possesses a great deal of ex- 
planatory fertility, it has been consid- 
ered by others as an ad hoc device de- 
signed to rescue the S-R theorist from 
embarrassing experimental data. Koch, 
for example, maintains that: 


Among the potentially most fruitful items in 
Hull’s bequest to the future are the many in- 
genious “peripheral” mechanisms which were 
elaborated in the first instance, in his earlier 
theoretical articles. . . . It is likely that con- 
cepts of this order point to factors which 
must be ultimately taken into account by any 
theory of behavior . . . (28, pp. 164-165). 


On the other hand, Meehl and Mac- 


Corquodale write that “. . . r, at pres- 
ent ... is as readily available a deus 
ex machina for nonexpectancy theorists 
as the concepts of ‘attention,’ ‘empha- 
sis, or ‘perceptual threshold’ are for 
Tolman” (31, p. 232). Since r, ap- 
pears to enjoy a somewhat ambiguous 
epistemological position, it would be 
well to clarify its meaning before at- 
tempting to provide an interpretation 
of the effects of latent extinction. 

The use of a theoretical construct de- 
noting a nonobservable response process 
carries with it the responsibility of pro- 


8 The additive relationship between V, K, 
and D presented here is consistent with 
Spence’s latest formulation (42). Following 
Hull, Spence had earlier postulated a multi- 
plicative relationship between these factors, 
the implication being, of course, that if any 
one of them is reduced to zero the excitatory 
strength of the instrumental response would 
also be reduced to zero. 
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viding a set of symptom relations by 
which the construct in question can be 
coordinated with observable events. A 
number of such reductions to data lan- 
guage is essential if the construct is to 
acquire univocal intersubjective refer- 
ence, and if it is to be integrated with 
other constituents of the theoretical sys- 
tem in which it functions. One way in 
which a set of reduction symptoms can 
be provided for r, is to conceive its re- 
lation to certain antecedent manipulable 
variables to be the same as that of any 
other molar response with which con- 
temporary behavior theory has been 
concerned (e.g., bar pressing, spatial re- 
sponses, salivation, etc.). This concep- 
tualization implies that such variables 
as, for example, number of reinforced 
trials, quality and magnitude of the re- 
ward object, and hours of deprivation 
will exert the same effect on the strength 
of r, as they have been found to exert 
on the strength of these molar responses. 
Once the relations postulated to hold 
between r, and certain experimental pa- 
rameters are Clearly set forth it becomes 
possible to specify the effect upon r, of 
variations in the value of any one of 
these parameters. However, it should 
be emphasized that, since r, is an in- 
ferred response and as such does not 
possess a directly observable element, 
evidence relevant to variations in the 
strength of r, can be obtained only by 
examining the overt behavior of the or- 
ganism under certain sets of experimen- 
tal conditions. 


III 


The Type I situation. Consider the 
situation in which an animal is required 
to turn right in a T maze to obtain a 
reward object relevant to an existent 
need state. When the excitatory tend- 
ency of the cues in the right alley to 
evoke a right turn becomes greater than 
that of the cues in the left alley to evoke 
a left turn, the animal will respond 
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above chance expectancy in the direction 
of the goal location. An important fac- 
tor involved in the growth of the cor- 
rect excitatory potential (S*R,) is the 
increase in habit strength (S7R) result- 
ing from each reinforcement. However, 
as the animal is repeatedly reinforced 
during the course of training, not only 
does S£R, approach its asymptote but, 
in addition, the overt consummatory re- 
sponse appropriate to the reward object 
(i.e., eating or drinking) becomes more 
strongly conditioned to the stimulus 
traces of the cues in the goal location. 
On the basis of the latter it appears 
reasonable to assume that when latent 
extinction is administered following the 
termination of training, that component 
of the consummatory response which 
can occur in the absence of the reward 
object (i.e., r,) will be evoked by the 
cues to which its molar counterpart was 
previously conditioned. Since these cues 
are constantly present during latent ex- 
tinction, it would be expected that r, 
will be strongly and repeatedly evoked; 
each latent extinction “trial” providing 
the occasions for its free repetition. 
Now if r, is conceived as a response 
process that obeys the same behavioral 
laws as most molar systems, then it 
would follow that these successive non- 
reinforced emissions of r, will produce a 
sharp reduction in its response strength 
with respect to cues in the goal location. 
If these cues are similar to those at the 
choice point, and if delay of reward 
during training was at a minimum, the 
extinction effects of r, will generalize 
readily to antedating segments of the 
behavior sequence, so that at the start 
of the test trials r, either will not be 
elicited at the choice point or will be 


+ Failure to conceive of latent extinction as 
providing the occasions for the free repetition 
of r, led Gleitman, Nachmias, and Neisser 
(14) to dismiss the possibility of employing 
r, to interpret the results of those experi- 
ments in which a Type I procedure was used. 
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elicited with greatly reduced excitatory 
potential. If latent extinction is effec- 
tive in reducing the strength of r, to 
the extent that it is subsequently not 
available at the choice point, then s,, 
of course, will also not be available, and 
the associative connection established 
during training between s, and the in- 
strumental response will no longer con- 
tribute to the habit strength of that 
response. As a consequence of the 
withdrawal of this component of habit 
strength, the organism will manifest a 
decreased tendency to enter the correct 
alley during the test trials. 

But assume that r, was not com- 
pletely extinguished during the latent 
extinction period, and that at the start 
of the test trials the cues at the choice 
point still possessed some excitatory 
tendency with respect to r,. Even if 
latent extinction only reduced this tend- 
ency, a decrease in the strength of the 
established instrumental response would 
nevertheless be expected. It will be re- 
called that the incentive motivational 
factor K was postulated to be a nonas- 
sociative component of S*R,. Since the 
relationship between K and s, is as- 
sumed to be monotonically increasing 
and, furthermore, since the intensity of 
Sy is a direct function of the r, evoked, 
it would follow that any decrease in the 
strength of r, will reduce the value of 
K. This reduction of K will, in turn, re- 
duce the excitatory potential mediating 
the correct approach response, as a re- 
sult of which the animal will show a de- 
creased tendency to enter and continue 
locomotion in the alley leading to the 
previously positive goal box. 

The Type II situation. The experi- 
mental procedures that we have labeled 
Type I and Type II are similar in that 
they both involve introducing an ani- 
mal into an empty goal location after it 
has acquired a response instrumental in 
securing a reward object previously con- 
tained in that location. The difference 
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between these procedures arises with re- 
spect to the responses with which they 
deal following the latent extinction pe- 
riod. It will be recalled that while the 
Type I procedure is concerned with the 
effect of latent extinction on the strength 
of an established response, the Type II 
procedure is concerned with the capacity 
of goal-box stimuli that have acquired 
secondary reward value to mediate the 
learning of a mew response after latent 
extinction has been administered. It is 
to the latter procedure that we now turn 
our attention. 

Consider the experimental situation in 
which an animal is trained to traverse a 
straight alley in order to obtain a re- 
ward object relevant to an existent need 
state. A differential reinforcement tech- 
nique is employed by which, on a given 
number of trials, the animal runs to a 
white goal box that contains the reward 
object, while on the remaining trials he 
runs to an empty black goal box. By 
the principle of secondary reinforcement 
the cues in the white goal box will ac- 
quire reward value as a function of their 
association with the reinforcement situa- 
tion. Hull (21, 22) suggested the frac- 
tional anticipatory goal response and 
its proprioceptive cue as providing the 
mechanism underlying the acquisition of 
secondary reward properties. The role 
that r, plays in this respect develops by 
virtue of its intimate association with 
the reinforcement situation, this asso- 


ciation establishing s, as a secondary . 


reinforcer. In order for any neutral 
stimulus to acquire reward value it 
must acquire the capacity to elicit r, 
either by being conditioned to r, di- 
rectly, as in the case of stimuli that are 
spatially and temporally contiguous with 
the receipt of the reward object, or 
through generalization, as in the case of 
stimulus events that are similar to those 
actually conditioned. Hull further sug- 
gested that a stimulus that has acquired 
reward value will retain that value as 
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long as it retains the power to evoke ry. 
It would appear reasonable on this basis 
to assume the acquired reward capacity 
of a stimulus to be some positive func- 
tion of the strength of the r, to which 
it is conditioned, and for any subsequent 
reduction in this strength to decrease 
the reinforcing value of the stimulus. 
Now with respect to the experimental 
situation being considered, assume that 
following training the positive and nega- 
tive goal boxes are placed on opposite 
arms of a T maze, and that the animal 
is given a number of free trials in the 
absence of the reward object. When he 
first responds correctly and enters the 
positive goal box, 7, will be evoked by 
those stimuli to which it was condi- 
tioned during the course of training. 
As a result of the evocation of r,, and 
the presence consequently of s, with its 
reinforcing capacity, there will occur an 
increment in the excitatory potential of 
the correct response. As long as r, con- 
tinues to be evoked in the positive goal 
box, the correct response will be rein- 
forced and will increase in strength rela- 
tive to the incorrect response. But the 
excitatory potential of the correct re- 
sponse will not continue to increase un- 
abated until some asymptotic value is 
reached. Since the reward object is not 
present during the maze series, the emis- 
sion of r, in the positive goal box fol- 
lowing each correct choice will reduce its 
response strength, as a result of which 
the intensity of s, and hence the mag- 
nitude of available secondary reinforce- 
ment will decrease progressively. When 
the stimuli in the positive goal box no 
longer elicit a threshold value of r,, the 
excitatory potential of the correct re- 
sponse will begin to decrease, so that 
after several trials the animal will cease 
to respond correctly in excess of chance. 
Now assume that latent extinction is 
administered at some time between the 
termination of straight-alley training and 
the beginning of the maze trials. For 
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reasons presented in connection with the 
Type I situation, placing an animal in a 
previously baited goal box provides the 
occasion for the free nonreinforced repe- 
titions of r,, each such repetition re- 
ducing its response strength. Because 
of the role that s, plays as a reinforc- 
ing agent, it would be expected that, as 
the strength of r, decreases during the 
course of latent extinction, the intensity 
of s,, and hence the secondary reward 
value of goal-box stimuli, will also de- 
crease. Depending on the duration of 
latent extinction and the presence of 
other conditions to be discussed below, 
r, either will not be elicited in the posi- 
tive goal box at the termination of the 
latent extinction period or it will be 
elicited with greatly reduced excitatory 
potential. In the event that r, is com- 
pletely extinguished with respect to goal- 
box stimuli, there will be no secondary 
reinforcement available to mediate sub- 
sequently the learning of a choice re- 


sponse in the maze. On the other hand, 
if the excitatory strength of r, had not 
been eliminated but had been reduced to 
a near threshold value, then r, will 
become completely extinguished early in 


the maze series. In any case, animals 
that have been given latent extinction 
would not be expected to show a strong 
tendency to choose the arm of the maze 
leading to the positive goal box, as com- 
pared with control animals that had not 
undergone the latent extinction experi- 
ence. 

By employing the fractional antici- 
patory response mechanism in a man- 
ner consistent with the requirements of 
the neobehavioristic system in which it 
was developed, we have been able to 
provide an interpretation of the effects 
of latent extinction in both the Type I 
and Type II situations. Within the 
theoretical framework employed it was 
necessary only to assume that r, is 
repeatedly evoked during latent extinc- 
tion, and that, as a consequence of this 
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repeated evocation, r, either becomes 
completely extinguished with respect to 
cues in the goal location or that a sharp 
reduction in its response strength occurs. 


IV 


The value of a theoretical construct 
should be measured not only in terms 
of how well propositions involving that 
construct explain available data but also 
in terms of the extent to which these 
propositions generate novel (i.e., not 
previously formulated) functional rela- 
tionships. It must be determined, there- 
fore, whether the fractional anticipatory 
response mechanism as employed in the 
present analysis is capable of mediating 
predictions concerning the influence on 
behavior of certain selected variations 
in both the Type I and Type II situa- 
tions. Unless the present analysis pos- 
sesses a prospective reference, proposi- 
tions involving r, have no value other 
than as vehicles in providing an eco- 
nomical description of already estab- 
lished empirical relationships. While 
such descriptions play an important role 
in science, the constructs involved func- 
tion simply as analytic devices whose 
systematic significance is limited to fur- 
nishing summary statements of observed 
facts (5). 

On the basis of our analysis of the 
effect of latent extinction on the strength 
of r, we can specify the following gen- 
eral hypothesis: Latent extinction will 
reduce the excitatory strength of an in- 
strumental response to the extent that it 
reduces the exicitatory strength of fg. 
Following from this is the implication 
that if a variable is introduced, during 
latent extinction, that increases the de- 
gree to which r, is weakened, a greater 
decrement in subsequent molar perform- 
ance will occur than if latent extinction 
is administered in the absence of this 
variable. It will be recalled that r, was 
conceived as a hypothetical response 
process whose relation to observable 
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antecedent events was assumed to fol- 
low the same behavioral laws as molar 
responses like bar pressing and saliva- 
tion. This implies that all those ex- 
perimental operations that have been 
demonstrated to reduce the strength of 
a molar response should affect similarly 
the strength of r,. Since empirical evi- 
dence appears to indicate that reduc- 
tion in the strength of these molar re- 
sponses occurs as a monotonic increasing 
function of the number of nonreinforced 
evocations, it would follow that the more 
frequently r, is evoked during latent ex- 
tinction the greater will be the reduction 
in its response strength. On this basis 
we would be led to expect latent extinc- 
tion to produce a pronounced decrement 
in subsequent molar performance under 
those experimental conditions that pro- 
duce a rapid rate of emission of r, dur- 
ing the latent extinction period. We 
turn now to consider some of these con- 
ditions. 

Variations in the strength of the rele- 
vant drive. ‘The results of several stud- 
ies have indicated that the higher the 
level of a relevant (rewarded) drive 
during a series of extinction test trials 
the greater the rate of response evoca- 
tion. For example, Jenkins and Daugh- 
erty (24) have shown that following 
training an increase in drive strength 
will result in the more frequent emis- 
sion of a nonreinforced pecking re- 
sponse, while Skinner (39) has pre- 
sented evidence confirming this relation- 
ship with respect to bar pressing. On 
the assumption that r, obeys the same 
behavioral laws as these responses, we 
would expect a higher drive level dur- 
ing latent extinction to result in a 
greater increase in the number of non- 
reinforced evocations of 7,.° On this 
basis the following hypothesis can be 
specified with reference to the Type I 


5 The relation between drive strength and 
response evocation also follows readily from 
Hull’s 1943 (21) and 1952 (22) postulate sets. 
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situation: If after being trained to a po- 
sition habit in a T maze under identical 
motivational conditions, one group of 
animals is given latent extinction under 
a high drive and another group under 
a low drive, the high-drive group will 
show a greater decrease in the tendency 
to choose the previously baited goal box 
when both groups are again run under 
the motivational level that prevailed 
during acquisition. A recent experi- 
ment by Pliskoff (36) is relevant to the 
present hypothesis. Animals under 8 hr. 
of food deprivation were trained in the 
performance of a T-maze turning re- 
sponse, after which latent extinction was 
administered under either 0, 8, or 22 hr. 
of hunger. When the extinction test 
trials were subsequently run under ac- 
quisition conditions it was found, in ac- 
cord with expectation, that on the first 
test trial the 22-hr. group chose the 
previously correct side of the maze less 
often than the O-hr. group. This result 
suggests a relationship between drive 
level during latent extinction and re- 
duction in the strength of a previously 
established response. 

The assumption concerning the rela- 
tion between drive strength and rate of 
emission of r, is also relevant to the 
Type II situation. It will be recalled 
that the acquired reward capacity of a 
neutral stimulus was assumed to be 
some positive function of the excitatory 
strength of the r, to which that stimulus 
was conditioned. This implies that if 
increasing the drive level during latent 
extinction serves to increase the num- 
ber of nonreinforced emissions of r, 
(and thereby to decrease the strength 
of the latter with respect to goal loca- 
tion stimuli), then there should be a 
more pronounced reduction in secondary 
reward value the more intense the drive 
level under which latent extinction is 
administered. Evidence supporting such 
a hypothesis was reported recently by 
Moltz and Maddi (33). They employed 
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a differential reinforcement technique 
during training which consisted of run- 
ning animals in a straight alley to a 
goal box of one color on rewarded trials, 
and to a goal box of a different color on 
nonrewarded trials. While all Ss were 
under the same hunger drive schedule 
during training (22 hr.), they were un- 
der either a O-, 22-, or 44-hr. drive at 
the beginning of the latent extinction pe- 
riod. The test trials were subsequently 
run in a T maze, with all Ss again on 
the same hunger deprivation schedule 
imposed during training. It was found 
that the mean number of responses (in 
15 free choices) to the side of the maze 
containing the positive goal box was 
9.30, 7.10, and 6.30 for the 0-, 22-, and 
44-hr. groups, respectively.*° These re- 


sults are in accord with theoretical ex- 
pectation, since they indicate a relation- 
ship between motivational level during 
latent extinction and the capacity of 
goal location stimuli to mediate subse- 
quently the learning of a new response. 


Variations in the strength of an ir- 
relevant drive. Hull has proposed that 
the drive strength available at any given 
amount is a function not only of the 
need for which the goal object in the 
situation is appropriate but also of all 
irrelevant (nonrewarded) drives opera- 
tive at the time. It is the total need 
state, consisting of both relevant and 
irrelevant drives, that is assumed to 
combine with S?¥R (in addition to the 
nonassociative factors V and K) to de- 
termine excitatory potential. In Prin- 
ciples of Behavior (21) Hull stated as 
a corollary that, with the number of re- 
inforcements held constant, response 
strength will increase as some positive 


6The F ratio for the difference between 
these drive levels was significant well below 
.01. No significant difference in maze per- 
formance was found between control groups 
which were not given latent extinction but 
which were made differentially hungry during 
the latent extinction period. 
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function of the magnitude of an irrele- 
vant drive. 

Webb (46) reported the results of a 
study designed to determine the role of 
an irrelevant drive (thirst) in activating 
a habit structure when the drive under 
which the habit was originally developed 
(hunger) was satiated. He found the 
strength of a panel-pushing response to 
be approximately an increasing linear 
function of the intensity of the irrele- 
vant drive. Brandauer (6) subsequently 
confirmed this relationship, using a dif- 
ferent operant response. Since we have 
assumed that r, follows essentially the 
same behavioral laws as most molar re- 
sponses, we appear to be in the some- 
what uncomfortable position of having 
to propose that r, appropriate to a 
relevant hunger drive will increase in 
strength in approximately linear fashion 
with increases in the magnitude of an 
irrelevant thirst drive. However, the 
implications of a theory or of a postu- 
late are determined not only by the re- 
lations or laws specified but also by the 
conditions operative in the experimental 
situation to which these relations or 
laws are applied. As a case in point, 
consider the situation into which there is 
introduced an irrelevant need that pro- 
vides a drive stimulus which elicits re- 
sponse incompatible with a previously 
established criterion response. Under 
such conditions it would not appear rea- 
sonable to expect the strength of the 
criterion response to increase as the ir- 
relevant need becomes more intense. In- 
deed, only when conflicting habit tend- 
encies are not activated would a cri- 
terion response, whether it be “molar” 
or “molecular,” be expected to increase 
with increases in the intensity of an ir- 
relevant drive stimulus. 

The results of several studies (1, 2, 
7, 13) in the area of secondary motiva- 
tion provide evidence in support of this 
contention. For example, Amsel and 
Maltzman (3) have shown that, when 
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anxiety is added to an existing motiva- 
tional complex, the strength of a previ- 
ously established consummatory (drink- 
ing) response will increase if the ex- 
perimental situation is arranged so that 
responses conditioned to the irrelevant 
anxiety-drive stimulus (e.g., crouching, 
face washing, escape responses, etc.) 
cannot compete with the drinking re- 
sponse. On the other hand, if the situa- 
tion is arranged so that response com- 
petition is permitted, the rate of drink- 
ing will be sharply reduced. These 
results, in addition to those provided by 
Kendler (26) concerning the “incom- 
patibility” of fractional anticipatory eat- 
ing and drinking responses, make it ap- 
pear reasonable to assume that antici- 
patory drinking responses (r9d), when 
instigated by an irrelevant thirst drive, 
will compete with anticipatory eating 
responses (r%e), and consequently will 
decrease the rate of emission of the lat- 
ter.’ Following from this is the impli- 
cation that after training under moder- 
ately intense hunger, the stronger the 
thirst drive that is subsequently intro- 
duced (thereby increasing the strength 
of r9d) the greater the reduction in the 
rate of evocation of r%e. 

This analysis generates an important 
deduction with respect to both the Type 
I and Type II situations. Assume that 
an animal learns to approach a distinc- 
tive goal location under moderately in- 


7 On the assumption that anticipatory drink- 
ing responses are in some manner “incom- 
patible” with anticipatory eating responses, 
Kendler has attempted to explain what he has 
called the switching phenomenon, ie., “... 
the tendency of rats after learning to go to 
one end of a T maze to obtain dry food to 
switch their spatial responses when made 
thirsty” (26, p. 179). He tested several de- 
ductions based on this assumption, one of 
which was that the conflict between r,e and 
r,d would be intensified (and hence the 
amount of switching increased) when S is 
shifted to a strong as compared with a weak 
thirst drive. The results obtained were con- 
sistent with this deduction. 


Howarp Mottz 


tense hunger, and that as a function of 
this training the cues in the goal loca- 
tion acquire a strong excitatory tend- 
ency with respect to r’e. If latent ex- 
tinction is administered subsequently 
under both hunger and thirst, the pres- 
ence of incompatible anticipatory drink- 
ing responses should reduce the number 
of nonreinforced evocations of r%e; this 
reduction being greater the more intense 
the thirst drive. On this basis, and on 
the basis of the previously formulated 
assumption concerning relation between 
rate of emission of r, and the influence 
of latent extinction on molar perform- 
ance, we can state the following general 
hypothesis: If an organism is trained 
to acquire a response under moderately 
strong hunger but with ad libitum con- 
sumption of water, the effectiveness of 
latent extinction in reducing the strength 
of this response will decrease as the in- 
tensity of thirst during latent extinction 
increases. More specifically, in the Type 
I situation we would expect that if, after 
learning a position habit under 22 hr. 
of hunger, one group of animals is given 
latent extinction under the same hunger 
drive but in conjunction with 3 hr. of 
thirst, and another group in‘ conjunc- 
tion with 22 hr. of thirst, the latter will 
yield significantly more responses to ex- 
tinction when the motivational complex 
of both groups is subsequently made 
identical with that which prevailed dur- 
ing acquisition. 

Since the relation between intensity 
of an irrelevant thirst drive and the rate 
of emission of r%e would be expected to 
obtain in a Type II situation as well, 
we may also state the following hy- 
pothesis: After an approach response to 
food is established under moderately in- 
tense hunger, a greater reduction in the 
acquired reward value of goal location 
stimuli will occur if, in conjunction with 
the prevailing hunger drive, latent ex- 
tinction is administered under 3 hr. of 
thirst as compared with 22 hr. of thirst. 
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Unfortunately there is no empirical evi- 
dence relevant either to this hypothesis 
or to the hypothesis formulated with 
reference to the Type I situation. Sev- 
eral experiments are, however, being 
conducted in the writer’s laboratory in 
the attempt to determine the role of 
drive interaction during latent extinc- 
tion.® 

Reward immediately prior to latent 
extinction. In the presence of the ap- 
propriate drive state, a reward object 
with which an animal has had previous 
commerce will serve typically as an “un- 
conditioned” stimulus for the consum- 
matory response, in the sense that the 
latter will be regularly elicited when 
the reward object is presented. Since 
r, has been conceived as a hypothetical 
process representing a fractional com- 
ponent of the consummatory response, 
it appears reasonable to assume that r, 
possesses this functional characteristic 
in common with its molar counterpart. 


On this basis we would expect that, 
with the primary drive state remaining 
relatively constant, the frequency with 
which r, is evoked will increase during 
a period following consumption of an 


appropriate goal object. However, if a 
large amount of reward is consumed, 
a reduction in the primary drive state 
would occur which, in turn, might off- 
set or prevent entirely an increase in 
the rate of emission of r,. On the other 
hand, a small amount of reward, while 
not appreciably modifying the drive 
state, might not be sufficient to effect 
this increase. Presumably there is an 
optimal amount of reward that would 
produce the desired result. With re- 
spect to food as the reward object, ex- 
perimental evidence reported by Maltz- 
man (30) indicates this amount to be 
within the range of 1000 to 2000 mg. 
when the animal is under a hunger drive 


8 These studies are being carried out in col- 
laboration with Miss Nina Tokarow and Mr. 
Leonard Rosenblum. 
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of approximately 22 hr.® On the basis 
of Maltzman’s results, we would be led 
to expect that if, after training under 
hunger motivation, an animal is fed 
about 1500 mg. of food 20 sec. prior to 
latent extinction, an increase in the 
rate of emission of r, will occur during 
the latent extinction period. This in- 
crease in rate of emission will, accord- 
ing to the present analysis, produce a 
marked reduction in the strength of r,, 
as a consequence of which latent ex- 
tinction should be more effective in re- 
ducing subsequent molar performance. 
Thus in the Type I situation it would 
be expected that if latent extinction is 
preceded by the ingestion of a reward 
object appropriate to the prevailing 
drive state, a greater decrement in the 
strength of the established instrumental 
response will occur, while in the case of 
the Type II situation a greater decre- 
ment in the acquired reward value of 
goal location stimuli would be expected. 

The hypotheses that have been pre- 
sented in the present paper have been 
concerned with the relation between cer- 
tain experimental variables introduced 
during latent extinction and the rate of 
emission of r,. Further experimental 
work and theoretical analysis might also 
be directed toward determining whether 
variations in molar performance follow- 
ing latent extinction can be ascribed té 
other parameters of r, function. In 
addition, the effect of latent extinction 
might also be investigated with regard 
to the influence of such training pa- 
rameters as the quality and magnitude 


®Maltzman found that hungry rats fed 
about 1200 mg. of food approximately 20 sec. 
before being run in a straight alley tended to 
run significantly faster than a control group 
that was not pre-fed. Several other investi- 
gators (4, 8, 34) ha.» also reported that small 
amounts of preliminary reward will facilitate 
the performance of an instrumental response. 
Maltzman has indicated how these results can 
be interpreted in terms of the fractional antici- 
patory response mechanism. 
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of the reward object, number of rein- 
forced trials, and the nature of con- 
textual stimulus conditions. 


V 


The present paper represents an at- 
tempt to provide an interpretation of 
latent extinction that is consistent with 
an S-R reinforcement approach. The ef- 
fects of latent extinction on the strength 
of an established instrumental response 
and on the secondary reward value of 
goal location stimuli were analyzed. It 
was found that propositions involving 
the fractional anticipatory goal response 
in conjunction with other propositions 
of neobehavioristic theory ‘are adequate 
to account for these effects. Several hy- 
potheses were suggested concerning the 
influence of certain variables on the rate 
of emission of the anticipatory goal re- 
sponse during the latent extinction pe- 
riod. Evidence relevant to these hy- 
potheses was considered. 
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The purpose of this paper, as its title 
implies, is to suggest a revision of the 
neural construct that Hebb (5) has 
called a “cell assembly.” The cell as- 
sembly plays a fundamental role in 
Hebb’s behavior system, especially in 
his admirable treatment of perceptual 
learning, but the proposed changes are 
not intended to affect these psychologi- 
cal implications of the construct; they 
are aimed, rather, at making the under- 
lying neural processes more explicit. As 
Hebb has mentioned (8), he tried to 
limit himself to using only known, or 
highly probable, properties of neurons 
in setting up his model; but it is clear 
that these soon proved to be inadequate, 
and he then had to introduce ad hoc 
mechanisms to explain how such arti- 


ficially simplified elements could com- 
bine to produce the observed molar be- 


havior. We can now see that from 
many standpoints the theory would have 
been better if the additional postulates 
had been introduced at the neural level, 
where they could have been more easily 
evaluated and tested; but on the other 
hand, such a theory might never have 
been given a hearing. 

There is little doubt that flaws exist 
in the arguments Hebb uses to derive 
the molar properties of his construct. 
As will be shown below, if we try to 
operate with only those properties and 
connections of individual neurons that 


1The writer wishes to acknowledge the in- 
fluence on this paper of many helpful argu- 
ments with his colleagues at McGill, especially 
those with Drs. James Olds and Seth Sharp- 
less, and with Professor D. O. Hebb. The pa- 
per was written while the author was receiv- 
ing support from the Foundations’ Fund for 
Research in Psychiatry, and from the Ford 
Foundation. 


Hebb considered respectable, the behav- 
ior of the resulting network is incom- 
patible with that required for the cell 
assembly. Some of the inconsistencies 
can be eliminated simply by taking into 
account neurophysiological discoveries 
made during the last ten years, but in 
order to derive a system having any- 
thing like the characteristics required 
for the cell assembly it will be neces- 
sary also to invoke a number of purely 
hypothetical neural processes. 

As far as can be ascertained from 
The Organization of Behavior (5), only 
four factors are considered to determine 
whether a cortical neuron will fire: (a) 
the number of impulses bombarding the 
neuron from all sources during the few 
milliseconds in which temporal summa- 
tion is assumed to take place; (>) the 
“strength” of the synapses concerned 
(the strength of a synapse may increase 
beyond its native value in accordance 
with the provisions of a learning postu- 
late); (c) whether the neuron is refrac- 
tory; and (d) by implication at least, 
the effect of neural fatigue. In the net- 
work that Hebb postulated as a model 
of the cortex, neurons with the above- 
mentioned properties are randomly in- 
terconnected, each neuron having effer- 
ent and afferent connections with many 
others. Some of the neurons can be 
fired by sensory input. 

Such a network is somewhat analo- 
gous to the nuclear fission reactor. In 
that machine a neutron may either in- 
duce a fission, in which case it releases 
several new neutrons, or it may be ab- 
sorbed or lost without releasing any 
other neutrons. If the neutron flux in 
the machine is to remain constant, all 
the neutrons released at a fission must 
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be lost except one. For stable opera- 
tion, in other words, the multiplication 
factor (the average number of fissions 
produced by the neutrons released at a 
previous fission) must be one. Like- 
wise, in the cortical network, a fired 
neuron may fire several others, or it may 
send impulses to refractory, fatigued, 
or inadequately facilitated neurons and 
produce no further firing. If the aver- 
age neuron fires one other neuron, the 
total activity in the network will re- 
main constant. 

The analogy is not quite complete, 
however; in the fission reactor the multi- 
plication factor is slightly reduced by an 
increase in the neutron flux, but in the 
cortical network just the opposite is 
true, the multiplication factor increas- 
ing with total neural activity. This re- 
sults from the effect of summation at 
synapses. When a neuron is firing by 
itself, the probability of its firing an- 
other is small; but when many neurons 


are firing at the same time, they pro- 
vide a facilitatory background for each 
other, and one of them may then be 
able to fire several neurons to which it 


sends efferents. Thus, the number of 
neurons fired by an average neuron will 
increase in proportion to the total num- 
ber of neurons firing, a state of affairs 
which precludes stable operation. In 
the nuclear reactor, if the multiplica- 
tion factor should fluctuate to a value 
of 1.001, a few more neutrons will be 
released than are lost, and the total flux 
will increase; at this higher flux the 
multiplication factor will be reduced 
again to one. If the same fluctuation 
should occur in the cortical network, a 
few more impulses will circulate, the 
general level of facilitation will rise, and 
the multiplication factor will increase 
still further. This increase will only be 
halted when all the neurons in the sys- 
tem are firing as fast as their refractory 
periods will allow. Unlike the nuclear 
pile, therefore, the cortical network has 
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only two levels of stable operation, “full 
on” and “full off.” Neither of these 
states fulfills the requirements of Hebb’s 
cell assembly. It is hardly necessary to 
add that the phase sequence, which is 
essentially a chain of associated cell as- 
semblies representing a “train of ideas,” 
bears as tenuous a relationship to the 
basic neural axioms as does the cell as- 
sembly itself. 

The fact that Hebb’s constructs are 
not rigorously derived is not as serious 
for the over-all theory as it might seem, 
because many of the more molar physio- 
logical principles that Hebb introduces 
are valid and important for the under- 
standing of psychological phenomena, 
irrespective of the exact neural mecha- 
nisms involved. Nevertheless, the pos- 
sibility of bridging the gap between the 
physiological and the psychological lev- 
els with a more substantial structure 
presents a fascinating challenge, espe- 
cially in view of the improvements made 
to the supports on the physiological side 
since The Organization of Behavior was 
written. 


A New Physiological Interpretation of 
the Cell Assembly 


In what follows it will be recognized 
that we are adopting Hebb’s descriptions 
of the cell assembly and the phase se- 
quence as far as their psychological 
properties are concerned, but instead of 
trying to reconcile these to a minimum 
of oversimplified neural data, further 
postulates will be introduced in an at- 
tempt to provide an explicit relation- 
ship between the constructs and their 
elements. 

The first additional postulate concerns 
inhibition. At the time when Hebb was 
developing his theory, many physiolo- 
gists were strongly opposed to the idea 
of neural inhibition, largely because it 
was difficult to fit into the electrical 
theory of synaptic transmission. There 
was a feeling that all inhibitory phe- 
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nomena could be explained by lack of 
facilitation, or by invoking the prin- 
ciple of refractoriness, and Hebb seems 
to have accepted this view. Since then, 
the work of Eccles (3) and others has 
provided a better understanding of syn- 
aptic transmission, and inhibitory trans- 
mission is now generally accepted. 

The processes underlying learning have 
still not been revealed by the neuro- 
physiologists, and to fill this lacuna 
there seems to be nothing against re- 
taining the postulate suggested by Hebb: 
“When an axon of cell A is near enough 
to excite a cell B and repeatedly or per- 
sistently takes part in firing it, some 
growth process or metabolic change takes 
place in one or both cells such that A’s 
efficiency, as one of the cells firing B, is 
increased” (5, p. 62). 

Another assumption that we shall 
make here is that the cell assembly 
involves mainly cortical neurons. It is 
to be understood, however, that connec- 


tions between cortical cells may often 
take place via cells in the thalamus, or 


other subcortical nuclei. In the model 
to be described, the fine-structure of the 
network has to be taken into account; 
Lorente de No’s beautiful diagrams of 
the cortex (11) are invaluable for this 
purpose. They reveal a complex verti- 
cal organization amongst the different 
layers, including one conspicuous circuit 
to which Lorente himself draws atten- 
tion. This is the return of collaterals 
from the long descending axons of some 
cells back to the region of the cell body, 
where they appear to end on Golgi type 
II, or similar short-axon cells. The 
Golgi cell axons proliferate in baskets 
around the bodies of the long-axon cells 
“. . . so that the discharge of the cell 
with a short axon results in powerful, 
practically simultaneous stimulation of 
a large number of other cells” (11, p. 
303). It might appear from this de- 
scription that the feedback circuit is ad- 
mirably suited to produce uncontrolled 
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Fic. 1. Diagram of the postulated connections 
of cortical neurons. 


epileptic discharge in the cortex, but 
recent microelectrode work by Eccles, 
Fatt, and Landgren (4) indicates that 
in the spinal cord, where a similar or- 
ganization of long and short-axon cells 
is to be found, the short-axon cells in- 
hibit the larger ones. It is likely that 
this is also true in the cortex. 

The following highly simplified or- 
ganization will therefore be assumed for 
the cortex (Fig. 1). Only two types of 
neuron are considered, those with long 
axons (having excitatory connections 
with other parts of the cortex or with 
the motor system), and those with short 
axons (which have local inhibitory con- 
nections). Those of the short-axon type 
receive impulses from the long-axon cells 
in their vicinity via recurrent collaterals, 
which leave the main shaft of the axon 
near the inner layer of the cortex and 
return toward the surface. Apart from 
those in the sensory cortex, which re- 
ceive additional afferents from the sen- 
sory pathways, the long-axon cells re- 
ceive their input from two major sources: 
(a) from other cortical cells, either 
directly via cortico-cortical association 
fibers, or indirectly by way of distribut- 
ing cells in the association nuclei of the 
thalamus; and (4) from the nonspecific 
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projection system, which will here be re- 
garded as a purely excitatory arousal 
system. This second source of afferent 
stimulation must play an important role 
in motivation, and it will be discussed 
in that connection later; for the mo- 
ment, it need only be noted that we 
postulate that the facilitatory bombard- 
ment it provides is necessary for the 
conduction of impulses within the cor- 
tico-cortical network. 

It appears from Lorente’s diagrams 
(11, pp. 296, 298) that the long axons 
from the association cells branch and 
travel through the white matter to a 
number of different parts of the cortex. 
Each of these branches divides again as 
it ascends through the layers of the 
cortex to synapse with other long-axon 
cells. With this in mind, we can now 
proceed to analyze the activity in this 
network in the way that we previously 
analyzed that of Hebb’s model. We 


shall start by assuming that no cortical 


cells are firing, but that there is a suffi- 
cient facilitation from the nonspecific 
projection system to make transmission 
between cortical cells possible. Suppose 
now that one cortical cell is somehow 
fired, and that it has effective connec- 
tions to several other long-axon cortical 
cells (ten, let us say). These ten long- 
axon cells will in turn fire about a hun- 
dred others, and so on. At this rate 
the activity would soon snowball to an 
astronomical value, so that it clearly 
cannot continue to multiply for very 
long; in fact, the spread will quickly be 
checked by the firing of inhibitory cells. 
As the density of firing cells increases, 
impulses from many sources will arrive 
almost simultaneously in any small re- 
gion of the cortex. When a long-axon cell 
is fired by one of the impulses, it will 
fire the short-axon cells in its vicinity 
and so prevent many subsequently ar- 
riving impulses from firing other neigh- 
boring cells. Thus, many of the im- 
pulses circulating in the cortex will be 
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lost because they arrive in the regions 
of strong inhibition surrounding re- 
cently fired cells. Finally, a level of 
activity will be reached at which only 
ten per cent of the impulses are able to 
fire cells, so that in the example given 
(in which one cell could fire ten others 
in the absence of inhibitory influences) 
a state of equilibrium will be reached. 
If more long-axon cells start to fire, the 
regions of inhibition will be crowded to- 
gether and less than one tenth of the 
circulating impulses will fire new cells; 
the activity must therefore drop back to 
the equilibrium value. If, on the other 
hand, too few cells are fired, the multi- 
plication factor will increase above unity 
and the total activity will increase again. 
It is true that many factors have been 
neglected in this analysis; some of them 
are important and will be considered in 
a moment; other difficulties (such as 
the fact that when the level of neural 
activity rises, the increase in inhibition 
will be offset to an indeterminate extent 
by increased temporal and spatial sum- 
mation) cannot be dealt with until bet- 
ter quantitative data are at hand. In 
the meantime we can only postulate that 
the parameters involved have values 
within the limits necessary to allow the 
network to behave in the manner de- 
scribed. 

For reasons that will be clear later, it 
is desirable to construct the model in 
such a way that the equilibrium activity 
continues to circulate within a single 
group of cortical cells, instead of pro- 
gressing continuously to new cells at 
each step in the firing. To this end, the 
following postulate will be made regard- 
ing the anatomical organization: the 
short-axon inhibitory cells which receive 
recurrent collaterals from a long-axon 
cell have fewer inhibitory connections 
to that particular long-axon cell than 
they do to other long-axon cells in the 
region. This is illustrated in an exag- 
gerated way in Fig. 2, where it is seen 
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that when Cell A is firing it causes the 
inhibition of its neighbors, B, C, and D, 
but is not itself inhibited. In fact, 


because the surrounding long-axon cells 
cannot now be fired, there is no way in 
which the short-axon cells discharging 
on to A can be fired. Therefore, as long 
as A continues to fire, it protects itself 
from being inhibited. This is not the 
only mechanism—perhaps not even the 


most likely mechanism—by which the 
desired result could be achieved (repeti- 
tive firing during the hypersensitive 
phase of the recovery cycle would also 
tend to confine the activity to the group 
of neurons that were already firing), 
but it is a useful hypothesis on which 
to base subsequent discussion. 

Let us now see how this postulate will 
affect the equilibrium activity that can 
be sustained in our model of the cortex. 
There is a good chance that the cell 
which originated the activity will re- 
ceive an afferent impulse from some 
other cell which is fired later in the 
avalanche. If it does, there will be 
nothing to prevent its firing again in 
response to the returning impulse, be- 
cause its original discharge will have 
inhibited neighboring cells, thus greatly 
reducing the probability that it will be 
inhibited itself. When it fires again it 
will also be able to re-excite the ten 
cells that it fired previously, because 
they will have likewise remained free 


from inhibition. Thus the whole group 
will tend to re-excite itself, confining the 
activity to those neurons in which it first 
spread. If by some chance the original 
cell does not receive connections from 
any of the cells subsequently fired, some 
of the next ten cells, or those further 
along in the chain of activity, will do so, 
and the final result will be much the 
same. 

It now remains to see what prevents 
the activity from persisting in this group 
of cells indefinitely. A property of most 
cells in the nervous system is that if 
they are subjected to a constant source 
of excitation their frequency of dis- 
charge steadily falls. We may there- 
fore expect a decline in the intensity of 
firing in the group. As each cell adapts, 
it will deliver fewer impulses to other 
cells in the group, and reduced bombard- 
ment will combine with the adaptation 
to slow down the activity. On the other 
hand, there will be a concomitant de- 
crease of inhibitory firing, so that the 
over-all activity will not decline as 
rapidly as it otherwise would. How- 
ever, there is a limit to the compensa- 
tion that can be provided in this way, 
because when the inhibition becomes too 
small some of the cells outside the group, 
which were previously prevented from 
firing, will be released. The firing of 
these cells will then bring in fresh short- 
axon cells which will immediately in- 
hibit any fatigued cells in the vicinity. 
Thereupon, the balance of excitation 
and inhibition will be disturbed in re- 
gions to which the affected long-axon 
cells send impulses. If most of the cells 
in the group were already becoming 
adapted, or fatigued, the disturbance 
will snowball in reverse; each cell that 
stops firing causing connected cells to 
stop also. In this way the firing of one 
group might come to an abrupt end, and 
another group of cells take over. Note 
that although the activity will swing 
from one group of cells to another, it 
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can never die out; long before that hap- 
pened the level of inhibition would be- 
come so low that all the unfatigued cells 
would be able to fire. The self-main- 
tained cortical activity described above 
is, as the reader has no doubt realized, 
intended to form the basis of the revised 
version of the cell assembly. 

One behavioral phenomenon which 
Hebb explains in terms of the cell as- 
sembly is “set.” In order to do the 
same with the present model we shall 
have to add a further neural postulate. 
Perhaps the simplest type of behavior 
exemplifying “set” is that in which a 
stimulus, A, followed by another stimu- 
lus, B, gives rise to the response, C, al- 
though neither A nor B alone will pro- 
duce the response. Presentation of A 
induces a set to respond to B. In neural 
terms, it seems that after a cell assem- 
bly has ceased to exert a direct control 
on an animal’s behavior, it continues to 
influence the way in which that behav- 


ior develops, perhaps for a period of 


minutes. Only two categories of neural 
process seem able to account for this 
long-lasting aftereffect of a cell assem- 
bly; either some neurons of the group 
representing Stimulus A, for example, 
continue to fire, though the main body 
of the activity is now taking place in 
cells of the group representing Stimulus 
B (this is the “active” trace hypothesis) , 
or the activity A leaves behind it physi- 
cal or chemical changes which last long 
enough to influence subsequent firing 
patterns (the “latent” trace hypothesis). 
Hebb attempts to explain “set” in terms 
of the first of these alternatives—the 
active trace hypothesis—and the same 
process might be applicable to the pres- 
ent model. However, the second mecha- 
nism seems to offer much more interest- 
ing possibilities, which we shall now 
proceed to explore. The hypothesis 
might be formally stated as follows: If 
afferent impulses fall on a cortical cell 
but do not fire it (because of inhibition 
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or lack of adequate background facilita- 
tion), they nevertheless leave the cell 
with a lowered threshold, the effect dy- 
ing away with a time constant of many 
seconds. This influence of one cortical 
neuron on another will be called “prim- 
ing.” This postulate complicates the 
original model slightly because now, at 
the moments of transition from one pat- 
tern of cortical activity to another, we 
have to take account of the residual 
priming left by earlier activities, as well 
as of the momentary excitations. For 
example, a neuron that receives syn- 
aptic connections from neurons of sev- 
eral previously active groups will have 
a better chance of firing than a neigh- 
boring neuron that has been stimulated 
only by neurons of the most recently 
active group. 

We must now examine the effect of 
sensory afferents on this postulated cor- 
tical system. The sensory projection 
areas of the cortex are regarded for this 
purpose as the final distribution centers 
for sensory impulses. The long-axon 
cells with specific-sensory afferent con- 
nections are assumed to have wide- 
spread efferent connections to the rest 
of the cortex, so that a change in the 
pattern of sensory input will result in 
multiple changes of the local fields of 
excitation there. If the cortex is already 
intensely active, and if the sensory 
change is a slight one, the added excita- 
tion will be unable to affect the ongoing 
pattern of cortical firing. The cells 
which are not already firing will be too 
strongly inhibited by those that are fir- 
ing to be influenced by the input. Be- 
haviorally, this means that the animal 
will ignore the stimulus and pursue its 
former course. On the other hand, if 
the stimulus is a strong one, or if the 
cortex is relatively inactive when it is 
presented, the sensory impulses will de- 
termine, or significantly influence, the 
pattern of the next activity in the cortex. 

The learning mechanism can now be 





248 


made explicit. It has been postulated 
that if a neuron helps to fire another 
neuron on several occasions, it becomes 
a more potent agent for firing that 
neuron subsequently. This clearly im- 
plies that the neurons of a group that 
fire one another, in the way that has 
been described, will tend to bind them- 
selves together more closely. Let us 
assume that one group has been acti- 
vated on several occasions, and that 
consequently each neuron in the group 
has strengthened its synaptic connec- 
tions with one or more other cells in the 
group, but not with any of the nonfiring 
cells to which its connections were ini- 
tially equally strong. At this stage, if 
a neuron in the group is fired, the likeli- 
hood of its firing other cells of the group 
will have been increased; and the more 
cells of the group that are so aroused, 
the more the remaining members will 
be subjected to preferential excitation. 
This selective effect will often cause the 
total group activity to be established, 
starting from the firing of only a frac- 
tion of the constituent neurons. More- 
over, each time this concerted firing 
takes place, the connections will be fur- 
ther strengthened, and the probability 
of the group’s firing as a unit will 
become still greater. This process is 
equivalent to Hebb’s mechanism for 
building up a cell assembly by repeated 
presentation of a “perceptual element”; 
its significance for the explanation of 
perceptual learning and concept forma- 
tion has been discussed in some detail 
by Hebb (5, 6) and need not be re- 
peated here. It must not be supposed, 
however, that after a stimulus has been 
presented often enough it will become 
able to determine the total pattern of 
the assembly; there will always be an 
inconstant “fringe” of cells that fire be- 
cause of some previous cortical activity, 
metabolic conditions, or other factors 
independent of the stimulus. This fringe 
plays an important role in the associa- 
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tion of one cell assembly with another, 
as we shall now see. 

Associative learning was mentioned 
earlier as being the linking together of 
cell assemblies into phase seque:ices. 
This will usually occur because the as- 
semblies in question have been aroused 
contiguously. It has been assumed up 
to now that the neurons that initiate an 
assembly activity are themselves fired 
by a sensory event, but clearly once a 
group of cells has strengthened its in- 
terconnections sufficiently, it does not 
matter which of its elements are fired 
first; the activity will in any case spread 
to the rest. If enough cells of one as- 
sembly are left in a primed or excited 
state by firing in other cell assemblies, 
that assembly may burst into activity 
when a previous one fades out. The 
question we have to answer is how such 
an interrelationship between two assem- 
blies could be established by experience. 

Let us suppose that two stimuli, giv- 
ing rise to Cell Assemblies A and B, re- 
spectively, are presented in succession a 
number of times. Under these condi- 
tions some of the fringe neurons of As- 
sembly B will be determined by A, and 
so will not be random any more. In 
other words, some of the cells primed 
by A will be active with Assembly B on 
every occasion that the stimulus pair is 
presented, and these cells will start to 
acquire stronger connections with the 
cells of the B assembly. Before long, 
in fact, they will become members of 
the assembly. When this has happened 
it will be possible for these cells, which 
on the first presentation of the stimulus 
pair were merely a part of the random 
fringe of B, to arouse the whole B cell 
assembly in the same way that any 
other fractional firing of the assembly 
would arouse it. It remains only to 
point out that these particular cells are 
the ones that are so strongly primed by 
the activity of the A assembly that they 
have a high probability of firing when 
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it stops (which is how they became 
part of the firing of B in the first place), 
and the method by which two assemblies 
become associated will be evident. Of 
the cells released from inhibition when 
Assembly A dies away, those that have 
on previous occasions fired with (and 
thus acquired strong connections with) 
cells of the B assembly, will be able to 
recruit the rest of the cells of that as- 
sembly. On the other hand, the suc- 
cessors of Assembly A which are incom- 
patible with firing in the B assembly 
will have no strengthened connections, 
and will be promptly suppressed as the 
rest of the B assembly begins to fire. 
It may be observed that the amount 
of perceptual “overlearning” that has 
gone into the development of the assem- 
bly, before its association with another, 
will make a great deal of difference 
to the ease with which the linking is 
achieved. If the total assembly action 
can spring from the firing of, say, five 
per cent of its constituent cells, links 
will have to be strengthened to a smaller 
number of cells than would be neces- 
sary if twenty per cent of the cells had 
to be fired to insure assembly action. 
That is, if the intragroup connections 
are weak to start with, the new cells will 
have to acquire stronger and more nu- 
merous connections to the rest of the 
group in order to compensate for the 
deficiency. Another point to be remem- 
bered is that only rarely will a single 
cell assembly be able to fire another one 
unaided, as in the example given. In 
most cases, the assembly aroused by as- 
sociation will have been primed by many 
previously active assemblies before it 
builds up enough sensitized cells to en- 
able it to fire without its proper sen- 
sory input. This is an explanation of 
the importance of context, and set, for 
association; the arousal of a particular 
association may depend on the surround- 
ings being similar to those in which the 
original learning took place, because the 
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firing of a number of cell assemblies by 
the environmental stimuli is necessary 
to build up priming in the required 
group. 

The model described above represents 
a brain mechanism capable of associat- 
ing contiguously presented stimuli. This 
is only part of the apparatus needed to 
explain an animal’s behavior in a learn- 
ing situation, but it is an essential part. 
The most important other mechanism 
has to do with motivation. In this pa- 
per motivation will be considered to in- 
fluence learned behavior in two ways: 
(a) by affecting the rate of learning, 
and (5) by helping to evoke responses. 

In order to discuss motivation from a 
physiological standpoint, we must return 
to the system which was mentioned 
briefly before, the nonspecific projection 
system. It will be remembered that our 
model of the functional organization of 
the cortex was developed on the assump- 
tion of a fixed level of facilitatory bom- 
bardment from this system, such that 
one long-axon cortical cell was able to 
fire ten others in the absence of inhibi- 
tion. If in the illustration we had as- 
sumed a lower level of nonspecific ac- 
tivity, the total cortical firing would 
have been less, though not proportion- 
ally so because of the accompanying de- 
crease in inhibitory firing. It might be 
remarked, in passing, that a number of 
loose ends may be rather neatly tied to- 
gether if it is assumed that impulses 
reaching a long-axon cell in the cortex 
from other cortical cells do not actually 
fire that cell, but induce a long-lasting 
depolarization that sensitizes it to bom- 
barding impulses from the arousal sys- 
tem. Thus the cell would be primed, 
but not fired, by other. cortical firing, 
and then fired by the next few impulses 
that it receives from the nonspecific 
arousal system. Such a mechanism 
would be consistent with the observa- 
tion of Li and Jasper (9) that the rate 
of cortical firing increases with the level 
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of arousal; it would also be in agree- 
ment with the data of Clare and Bishop 
(2) on the long-lasting dendritic de- 
polarization set up by association affer- 
ents. Moreover, the work of Bradley 
(1) might be interpreted as indicating 
that the arousal system afferents are 
cholinergic, and as acetylcholine is de- 
stroyed instantly on being released it is 
unlikely that the arousal system im- 
pulses would give rise to any long-last- 
ing effect in the cortex. 

Lindsley (10) has indicated that there 
is a rough correlation between the mo- 
tivational and emotional states of an 
animal and the level of activity of its 
arousal system; and Sharpless (12) has 
shown that novel stimuli are more po- 
tent stimulators of the arousal system 
than are familiar ones. It is also gen- 


erally true that a hungry or otherwise 
uncomfortable animal has an increased 
level of arousal activity, especially in 
the presence of a satisfier. It thus seems 


that conditions which increase firing in 
the nonspecific system are those which 
tend to promote learning. There is still 
dispute as to how essential reinforce- 
ment might be for learning, but there is 
little doubt that some sort of “empha- 
sis” on the relevant stimuli accelerates 
the learning process. In terms of our 
model, this means that cortical cells are 
linked together more rapidly when the 
rate of cortical activity is higher—a 
most reasonable conclusion. It might 
also be-.deduced from the model that 
some association should be possible as 
long as any cortical cells are firing (dur- 
ing dreams, for example). The rein- 
forcement controversy thus reduces to 
the question of how much firing is going 
on in the cell assemblies concerned, re- 
inforcing agents serving to increase that 
firing through their effects on the non- 
specific projection system. According to 
this theory, therefore, the arousing rather 
than the need-reducing properties of the 
reinforcement are those important for 
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learning. Evolution would account for 
the fact that, for most present-day ani- 
mals, need reducers and dangerous situa- 
tions tend to be particularly arousing. 
It should be noted that, in the exam- 
ples given, it has not been a response 
but rather a relationship between stimuli 
that has been learned. It is not too 
difficult to extend the model to account 
also for the association between stimuli 
and responses, but we then have to face 
the problem of why the stimulus is 
sometimes followed by the response and 
at other times not. No doubt this is 
sometimes due to the failure of effective 
association between the cell assemblies 
concerned, but we have introspective 
evidence which suggests that we may 
know what to do to achieve a goal, but 
not be sufficiently interested to perform 
the actions. This brings us to the prob- 
lem of the effect of motivation on per- 
formance. Unfortunately, we cannot do 
justice to this topic in the space of this 
paper, but a brief indication will be 
given as to how the model might be 
able to handle it. We must assume that 
many motor neurons will be fired along 
with the neurons of the cell assemblies 
we have described, but because they 
must compete with postural and other 
reflexes for control of the musculature, 
they can only produce a response if 
they fire at a high enough frequency. 
Below some threshold value, cortical fir- 
ing has no effect on overt behavior. It 
is suggested that the arousal system is 
responsible for raising cortical activity 
above this threshold when the stimu- 
lus conditions are sufficiently exciting. 
(However, as Hebb has suggested [7], 
if the firing becomes too vigorous, some 
other factor may intervene to prevent 
further responses.) In the case of mo- 
tivation set up by expectancy of reward 
or punishment, it must be assumed that 
the arousal system has been excited, not 
by the stimulus actually present, but by 
a cell assembly representing the moti- 
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vating stimulus which, through previous 
experience in the situation, had become 
associated with the cell assembly for the 
actual stimulus. The cell assembly for 
the motivating stimulus will be able to 
fire the arousal system because of an as- 
sociation built up through innumerable 
presentations of the motivating stimu- 
lus, under conditions where it fired both 
the cortical and the subcortical systems 
simultaneously. Thus the environmen- 
tal stimuli fire their cell assemblies, 
these arouse the assembly of the moti- 
vating stimulus by association, and this 
in turn fires the arousal system, also by 
association. 

An example might make the proposi- 
tion clearer. When a rat is first placed 
in a maze, the arousal value of the novel 
surroundings will be sufficient to raise 
the level of cortical firing so that more 
or less randomly associated “motor” cell 
assemblies will influence overt behavior; 
the rat will explore. If no incentive is 
introduced, the effect of novelty will 
wear off and the rat will lie down and 
doze. However, if the rat is food-de- 
prived, and food is placed in the maze, 
the interaction of the stimuli from the 
food with the cells in the rat’s nervous 
system that are sensitized by depriva- 
tion will produce persistent excitation of 
the arousal system. Moreover, cell as- 
semblies fired by the food stimuli (if 
the food is familiar) will have strong 
connections with the cells in the arousal 
system, so that even if the deprivation is 
not severe the presence of food will be 
likely to produce arousal. These cell as- 
semblies for food stimuli will, of course, 
become associated with any other cell 
assemblies active at about the same time 
(those representing the sight of the food 
dish, or the feel of the maze floor, for 
example). Therefore; when on a subse- 
quent occasion the hungry rat is re- 
turned to the maze, one of the assem- 
blies fired by the stimuli in the starting 
box may be able to excite, by associa- 
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tion, the assembly which represents the 
food, and thus, indirectly, the arousal 
system. The firing in the latter will not 
die out as soon as the cell assembly 
which aroused it ceases to fire; activity 
in the arousal system does not stop ab- 
ruptly (possibly because its activity re- 
leases adrenalin into the blood stream). 
It will therefore continue to bombard 
the cortex at an increased rate for per- 
haps several minutes, and the motor 
component of any cell assemblies occur- 
ring during that time will fire intensely 
enough to produce a response. If, for 
example, a running response has been 
associated with the starting-box stimuli, 
the rat will actually run; if the same 
cell assembly (for running) had become 
active before the arousal system had 
been fired by the association between the 
starting-box stimuli and food, then the 
cortical firing would probably not have 
been intense enough to elicit overt move- 
ments. The same sluggishness would be 
expected if the animal was sated, so that 
the cells of the arousal system were not 
sensitive to food stimuli or their asso- 
ciated cell assemblies. 

The above example is admittedly su- 
More complicated situations, 
such as avoidance learning, or extinc- 
tion, cannot be explained without intro- 
ducing still more postulates, in particu- 
lar a system for inhibiting movements. 
The value of such a model would be pri- 
marily to demonstrate that it is possible 
to design a machine that would not only 
“want” or “avoid” arbitrarily designated 
stimuli, but also learn what to do about 
them under all environmental conditions 
to which it was sensitive. Whether the 
design would bear any relationship to 
the actual mechanism of animals which 
can also do these things could be known 
only after further research. 


SUMMARY 


A neural model has been presented, 
based on an “association-of-ideas” para- 
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digm of learning, similar to that used by 
Hebb. In it groups of neurons (cell as- 
semblies) become representors of stimuli, 
and can then be linked together by be- 
ing fired contiguously. The model dif- 
fers from Hebb’s in that an inhibitory 
regulatory system is postulated which 
limits (to a minute fraction of the to- 
tal) the number of cortical neurons that 
can fire simultaneously, and insures that 
those firing are dispersed as widely as 
possible. A further change is introduced 
to meet the paradox that cell assemblies 
can be associated with one another with- 
out losing their individuality and being 
submerged in a composite new cell as- 
sembly. In association, it is not the 
cells of one assembly that acquire con- 
nections with the cells of another; in- 
stead, cells primed, or sensitized, by the 
first assembly become incorporated into 
the second. Thus, one principle of 
learning—the binding of cells into a 
group by repeated simultaneous firing— 
fulfills a double role; when the newly 
added cells are predominantly primed 
by sensory :nput perceptual learning re- 
sults; and when the new cells are primed 
by the firing of another cell assembly, 
associative learning results. 

Because the effect of priming lasts for 
many seconds, it is possible for a cell 
assembly to accumulate the sensitiza- 
tions induced by the activities of a num- 
ber of associated assemblies, and so in- 
crease the probability that it will itself 
fire. 

The dual role of motivation—the fa- 
cilitation of learning and the elicitation 
of responses—has been discussed in 
terms of the arousal effects of the non- 
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specific projection system on the postu- 
lated cortical network. 
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The purpose of this paper is to dis- 
cuss two related topics: (a) the role of 
a person’s level of aspiration in his de- 
cision making, and (b) the measurement 
of level of aspiration in a decision-the- 
ory context. The discussion of the sec- 
ond topic will include a brief summary 
of some experimental evidence which 
lends support to the methodological 
stand taken here. 

The notion of level of aspiration is 
invoked in reference to the goal-striving 
behavior of an individual when he is 
presented with a task whose outcome 
can be measured on an achievement 
scale. Level of aspiration refers to the 
particular achievement goal for which 
the person strives. 

The concept of level of aspiration was 
introduced by Dembo (8), and the first 
experiment in the area was conducted 
by Hoppe (14). It is a familiar con- 
cept to psychologists and others, having 
been the topic of extensive discussion 
and experimentation in the last quarter 
of a century. An early review of the 
literature is given by Frank (13). Rot- 
ter has offered a critical review of the 
methodological aspects of level of as- 
piration studies (19). An exceptionally 
important theoretical article, by Lewin, 
Dembo, Festinger, and Sears, appeared 
in 1944 (16). 


1 This paper is based on research conducted 
under Research Grant M-1328 from the Na- 
tional Institute of Mental Health and under a 
grant from the Council on Research of The 
Pennsylvania State University. The author is 
indebted to Selwyn W. Becker, Danuta M. 
Hiz, and Henry T. Hiz for discussing with him 
many of the ideas presented here. 

2 On leave from the Pennsylvania State Uni- 
versity, 1957-1958 year. 


DECISION THEORY AND LEVEL OF 
ASPIRATION THEORY 


It is the contention of the present 
writer that the psychological situation 
established in level of aspiration experi- 
ments may profitably be characterized 
as a decision situation, for from the al- 
ternative possible goals the individual 
must decide for which goal he will 
strive. 

It is a remarkable fact that, by a 
simple change in nomenclature, the theo- 
retical model used by Lewin et al. (16) 
in the prediction of the choices (deci- 
sions) of individuals in a goal-striving 
situation—a model based on the work 
of Escalona (11) and Festinger (12)— 
may be rendered fundamentally equiva- 
lent to the theoretical model employed 
by decision and game theorists. This 
latter theory, first advanced by Bernoulli 
(2), discussed by Ramsey (18), and 
formalized by Von Neumann and Mor- 
genstern (28) and by Savage (20), 
states that under conditions of uncer- 
tainty individuals behave as if they were 
attempting to maximize expected utility. 
According to these and other decision 
theorists, an individual’s decisions un- 
derlying his choices among alternatives 
involving uncertain outcomes (outcomes 
with stated probabilities of attainment) 
are based on the utilities of the entities 
(objects, actions, goals, etc.) and on the 
probabilities (subjective probabilities, 
for most decision theorists) associated 
with attainment of the entities. The 
decisions are a function of these two 
variables (utility and subjective prob- 
ability) in that the individual seeks by 
his choices to maximize the sum of the 
products of probability and utility, i.e., 
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he acts so as to maximize 


SEU = > pii, 


where SEU is subjectively expected util- 
ity (cf. [10]). 

Equivalence of terms. Lewin et al. 
assert that the level of aspiration set by 
the person is a function of three fac- 
tors: (a) “the seeking of success,” (0) 
“the avoiding of failure,’ and (c) the 
“cognitive factor of a probability judg- 
ment” (16, p. 376). The last-men- 
tioned factor refers to the individual’s 
subjective probability of achieving a 
goal. (Lewin et al. use the term “sub- 
jective probability.” See [16] p. 361 
et seq.) Each of these three concepts 
of Lewinian theory may be “translated” 
directly to an equivalent concept in de- 
cision theory. The achievement scale 
may be thought of as a utility scale, 
where utility refers to the subjective 
value of an entity (goal, action, object) 
as distinguished from its objective or 
“face” value. It is the contention of 
this paper that each level on an achieve- 
ment scale has a certain utility for a 
person, and that therefore level of as- 
piration may be conceived as a “point” 
on a scale of utility. An achievement 
which represents “success” in Lewinian 
terms may be thought of in decision the- 
ory terms as one which has “positive 
utility.” An achievement which repre- 
sents “failure” to the person may be 
thought of as having “negative utility” 
for him. And the Lewinian notion of 
“subjective probability” needs no trans- 
lation, for the same term is used in de- 
cision theory. 

The demonstration that certain con- 
cepts in Lewinian level of aspiration the- 
ory are equivalent to certain key con- 
cepts in decision theory is important be- 
cause of the consequences it may have. 
Because of these equivalences, two im- 
portant bodies of research literature may 
be synthesized, with benefit to both. 


SIEGEL 


The benefit to traditional psychologists, 
accustomed to dealing with terms like 
“success” and “failure,”’ will accrue be- 
cause of the much greater formality and 
rigor characteristic of decision theory, a 
rigor unknown to many psychologists. 
The benefit to decision theory will come 
because the synthesis will render avail- 
able to decision theorists a large amount 
of existing experimental evidence with 
relevance to their work which has not 
yet been recognized by them. 

Equivalence of formal statements of 
relations. The similarity between the 
two theories does not rest with equiva- 
lence of their variables. Also compa- 
rable are the formal statements of the 
relations among these variables. 

For example, Lewin et al. (16, p. 360) 
state that: 


The individual faces the possibility of succeeding 
or failing and the positive or negative valence of 
such a future success or failure on the various 
levels is one of the basic elements for the decision. 
To determine the valence (Va) of each level (n) 
of activity [Va (A*)] we have to consider the 
negative valence that future failure has on that 
level [Va (Fai A") ] and the positive valence of 
success on that level [Va (Suc A*)]. 


Va (A*) = Va (Suc A") + Va (Fai A*). 
When discussing the individual’s felt 


probability of success (16, p. 362), they 
also state that: 


The most simple assumption is that this 
“weighted” valence of success [°Va (Suc A*) ] 
is the product of the valence and the probability 
of success 


°Va (Suc A") = Va (Suc A")-Prob (Suc A*). 
The corresponding formula for failure is: 
°Va (Fai A") = Va (Fai A*)-Prob (Fai A*). 


If Lewin ef al. had proceeded to the logical 
conclusion of these ideas, they could 
have stated that 


Expected Va (A") = Prob (Suc A”) 


-Va (Suc A") + [(1 — Prob) 
x (Suc A") ][Va (Fai A*)], 
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which in decision theory terms is given as 


Expected Utility = Probability- Utility 
of Success + (1 — Probability) 
- Utility of Failure 


in which it is asserted that the indi- 
vidual chooses among alternatives (some- 
times called “courses of action”) so as 
to maximize expected utility. 

This formula is equivalent to that 
given by Von Neumann and Morgen- 
stern (28), and has been utilized in 
studies of decision behavior by David- 
son, Siegel, and Suppes (7), Mosteller 
and Nogee (17), Siegel (21), Hurst 
and Siegel (15), Edwards (10), and 
others (see [9]). 

In the discussion of Lewin et al., the 
problem of obtaining a rigorous meas- 
urement of utility (degree of valence) 
and of subjective probability is virtu- 
ally disregarded, despite the fact that 
these are the variables upon which the 
entire structure of the model depends. 
In the literature of decision theory, on 
the other hand, attention has been given 
to this important facet. See, for exam- 
ple, the experiments by Mosteller and 
Nogee (17) and by Davidson, Siegel, 
and Suppes (7) on the measurement of 
utility in an interval scale and on the 
measurement of subjective probability, 
and also those of Siegel (21) and 
Coombs and Komorita (6) on the meas- 
urement of utility on an ordered metric 
scale. 

Another contrast between the two 
theories rests on the fact that in decision 
theory the formal theoretical models are 
based on a set of assertions from which 
consequences can be derived by logico- 
mathematical arguments. The asser- 
tions are referred to as axioms of the 
system (cf. [20]). If the model is used 
to interpret “real world” phenomena, it 
is possible to know, without ambiguity, 
what aspect of the model is analogous 
to what thing (or action) in the real 
world. 
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When such an abstract and formal 
system is available, it sometimes hap- 
pens that the mathematical system will 
serve as a model for more than one 
situation in the real world. This fortui- 
tous event may reveal analogies or re- 
lations of whose existence scientists were 
previously unaware. This seems to be 
the case in the present instance. A for- 
mal theoretical model has been con- 
structed which is useful in interpreting 
decision behavior in general. It is con- 
tended in this paper that this model will 
encompass decision behavior in goal- 
striving or achievement situations, i.e., 
in situations where level of aspiration is 
an important variable of behavior, and 
furthermore that level of aspiration may 
usefully be defined in decision-theory 
terms and measured in a decision-theory 
context. 


LEVEL OF ASPIRATION AS A DETER- 
MINANT IN DECISION MAKING 


Suppose an individual is given the 
task of finding as many as he can of 
five hidden objects, which will be his to 
keep. Suppose further that this indi- 
vidual aspires to find and thereby ac- 
quire at least four of these. That is, 
the acquisition of four or more of the 
objects is defined as “success” by this 
individual, and the acquisition of any 
fewer will constitute “failure” for him. 
Now suppose that before he attempts to 
find the objects, he is asked to choose 
between the following alternatives: (a) 
a 50-50 chance of getting either four of 
the objects or none of the objects, or 
() assurance of getting three of the ob- 
jects. That is, suppose that the out- 
come of the task is completely deter- 
mined by the individual’s choosing either 
of the alternatives above and then by 
the toss of a “fair” coin. If the indi- 
vidual chooses Alternative (a) above, 
and if the coin lands “heads” he gets 
four objects, whereas if it lands “tails” 
he gets none at all. On the other hand, 
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if he chooses (4) he gets three objects 
if the coin lands at either heads or tails. 

Now since the individual has set the 
acquisition of four objects as his level 
of aspiration, and the acquisition of any 
fewer will be considered as a “failure” 
by him, it seems clear that he will choose 
Alternative (a), despite the fact that 
objectively Alternative (b) has greater 
expected value. In other words, Al- 
ternative (b) may have greater expected 
value, but Alternative (a) has greater 
expected utility for this person. His 
level of aspiration has made the utility 
of the objects nonlinear with their ob- 
jective value. 

In game-theory terms, the above situa- 
tion may be described as a one-person 
game in which the subject chooses be- 
tween two alternatives, each of which is 
a probability-combination of two out- 
comes. The format for the game is: 


Alterna- 
tive (d) 


Alterna- 
tive (a) 





If heads occurs 
If tails occurs 


you get 3. 
you get 3. 


you get 4 
you get 0 
The subject chooses the column; the 
outcome of the toss of the coin deter- 
mines the row. 

Now if the subject has Level of As- 
piration 4, and if therefore he chooses 
Alternative (a) over (0), this tells us 
that for this individual the difference in 
utility between 4 and 3 objects is greater 
than the difference between 3 and 0. 

That conclusion is based on this rea- 
soning: if 
Expected Utility of (a) 


> Expected Utility of (6), (1) 


where u(x) is read as “the utility of ac- 
quiring x objects” and interpreted to 
mean the subjective value of x, i.e., its 
worth to the person, then Statement 1 
can be written 


p-u(4) + (1 — p)-u(0) > 


p-u(3) + (1 — p)-u(3). (2) 
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If p is understood to be subjective prob- 
ability, and is known to be one-half, 
then Statement 2 may be written 


u(4) + u(0) > u(3) + u(3) (3) 


and 


u(4) — u(3) > u(3) — u(0). (4) 


That is, when Alternative (a) is pre- 
ferred, we may infer that Statement 4 
above is true, i.e., that the difference in 
utility between 4 and 3 objects is larger 
than the difference in utility between 3 
and 0 (21, pp. 212-213). 

We can see from the above that for 
this individual the acquisition of four or 
more objects has positive utility. The 
acquisition of three or fewer has nega- 
tive utility. Zero utility is represented 
by some point between three and four. 
It should also be noticed that the level 
of aspiration is associated with the 
largest difference on the utility scale. 

We may cite another example to illus- 
trate the same points. Suppose a stu- 
dent is enrolled in a course in which he 
aspires for the grade of B. Before the 
examination on which the course grade 
is based, he is asked to choose between 
the following alternatives: 


Alterna- 
tive (b) 


Alterna- 
tive (a) 





If Event 
E occurs 

If Event 
Non-E occurs 


you getan A you geta B. 


you geta C you get a B. 


If the individual believes that the prob- 
ability of the occurrence of Event E is 
.50, it seems clear that he will choose 
Alternative (5), since his level of aspira- 
tion is to achieve a grade of B. By this 
choice, the individual demonstrates to us 
that the difference in utility between B 
and C is greater than the difference in 
utility between A and B. That is, on 
his utility scale B is closer to A than to 
C. See Fig. 1. 
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Fic. 1. 
Now the same person is asked to 
choose between these: 


Alterna- 
tive (d) 


Alterna- 
tive (a) 


If Event 
E occurs 

If Event 
Non-E occurs 


you get a B you get aC. 


you get a D you get aC. 


If his subjective probability toward 
Event E is .50, he will choose Alterna- 
tive (a), for in terms of his level of 
aspiration any grade less than B is a 
“failure.” His selection of Alternative 
(a) demonstrates that for him the dif- 
ference in utility between Grades B and 
C is larger than that between C and D, 
as shown in Fig. 2. Again we see that 
the level of aspiration is associated with 
the upper bound of the largest distance 
on the utility scale. 


LEVEL OF ASPIRATION DEFINED 


Before we offer a formal definition of 
the level of aspiration, we may summa- 
rize what has been said above concern- 
ing it: 

1. The level of aspiration of an indi- 
vidual is a position on his utility scale 
of an achievement variable. 

2. The lower bound of that position 
is neutral in utility, ie., all points be- 
low the lower bound have negative util- 
ity (psychological feeling of dissatisfac- 
tion) associated with them and all points 
above the lower bound have positive 
utility (psychological feeling of satis- 
faction) associated with them. 

3. The level of aspiration is associ- 
ated with the upper bound of the largest 
distance on the utility scale. 


B Cc D 
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Each of the characteristics of level of 
aspiration enumerated above is incorpo- 
rated in the following formal and more 
rigorous definition of level of aspiration: 

The level of aspiration of an indi- 
vidual is a point in the positive region 
of his utility scale of an achievement 
variable; it is at the least upper bound 
of that chord (connecting two goals) 
which has maximum slope; i.e., the level 
of aspiration is associated with the higher 
of the two goals between which the rate 
of change of the utility function is a 
maximum. 

It may be seen from this definition 
that the difference in utility between 
achieving at the level of aspiration or 
achieving at the next lower level is 
greater than is the difference in utility 
between achieving at the level of as- 
piration or achieving at the next higher 
level. In other words, the level of 
aspiration is that goal which has the 


U(A) 


U(B) 





\ena---------------16 
elewaeoue 


U(F) 





Fic. 3. Graphical representation of the util- 
ity of grades for an individual whose level of 
aspiration is “B.” 
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largest distance (difference) in utility 
between it and the next lower goal. 

Figure 3 demonstrates this graphically. 
The curve represents the utility func- 
tion of a student who aspired for a B 
and who made the choices discussed 
above, among others. It may be seen 
that the differences between the utility 
of B, U(B), and the utility C, U(C), 
is larger than any of the other differ- 
ences. The ordered metric utility func- 
tion may also be represented as it is 
shown in Fig. 4. 


LEVEL OF ASPIRATION AS A PROBLEM 
IN MEASUREMENT 


The preceding discussion is directed 
to the point that under certain condi- 
tions the task of ascertaining an indi- 
vidual’s level of aspiration reduces to 
the task of measuring his utility of the 
goals in question. That is, when an 
achievement variable is involved, the 
level of aspiration on the achievement 
scale may be derived by ascertaining 
the individual’s utility function for the 
goals in question. 

How strong must the measurement be? 
Obviously the weaker sorts of measure- 
ment—nominal and ordinal scaling— 
will not be satisfactory, for such scaling 
yields no information about the dis- 
tances between the entities on the scale. 
Since the identification of level of as- 
piration requires some knowledge of the 
distances on the scale, utility must be 
measured on a stronger scale than an 
ordinal one. 

The strength of measurement which 
is required depends on the nature of 
the variate—on whether the possible 
achievement goals are represented by a 
continuous variate or are a discrete set 
of entities with an underlying con- 
tinuum. 


If the possible goals are on a con- 
tinuum and if the achievement scale is 
represented by a continuous variate, i.e., 
if the scores (goals) may have any value 
in a certain range, then at least inter- 
val measurement of utility is necessary 
to derive level of aspiration. In inter- 
val measurement, the distances between 
points (i.e., the intervals) are known 
numerically (22, 26, 27), and thus the 
numerically largest distance may be 
identified and the level of aspiration 
thereby located. 

If the possible goals are discontinu- 
ous and if the achievement scale consists 
of a discrete set of entities, ie., if the 
scores (goals) can take only isolated 
values, then at least ordered metric 
measurement is necessary to derive level 
of aspiration. In strength, an ordered 
metric scale falls between an ordinal 
scale and an interval scale. An ordered 
metric scale (4, 5, 21) not only gives a 
ranking of the entities involved but also 
a ranking of the distances between the 
entities. With an ordered metric scale 
we can identify the largest distance and, 
inasmuch as the individual’s choice is 
limited to the discrete set of goals avail- 
able, we thereby find that goal which 
best represents his level of aspiration of 
those available. 

The problem of obtaining an interval 
measure of utility has been dealt with 
by Davidson, Siegel, and Suppes (7). 
Their paper includes an axiomatization 
of the interval measurement of utility 
as well as a report of an experiment in 
which interval scales of utility were suc- 
cessfully derived for 15 subjects. 

A method for obtaining an ordered 
metric measure of utility has been de- 
veloped by Siegel (21). Examples of 
applications of this method in tests of 
various hypotheses are given in (15), 
(23), and (24). 

Most level-of-aspiration situations, if 
not all, involve a discrete set of goals. 
Even if there is continuity underlying 
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the achievement scale, as is the case 
in achievement in throwing darts at a 


board with a bull’s-eye, the possible 


goals are usually broken up into a dis- 
crete set. In the case of the dart board, 
these are concentric circles around the 
bull’s-eye. Thus it would seem that for 
most situations measurement in an or- 
dered metric scale is all that is neces- 
sary to derive an individual’s level of 
aspiration. 

To test the validity of the statement 
that the level of aspiration is that goal 
which is the least upper bound of the 
largest distance on an ordered metric 
scale of utility, a study was conducted 
which is reported fully elsewhere (1). 
A summary report of it is included here 
to lend greater clarity to some of the 
topics discussed. 


EXPERIMENTAL VERIFICATION 


Becker and Siegel (1) have conducted 
a study wherein they tested the hy- 
pothesis that the ordered metric utility 
function would yield information on the 
individual’s level of aspiration. 

Subjects were 20 students enrolled in 
an elementary course in statistics. These 
were students who volunteered to gam- 
ble with the instructor for their mid- 
term grade, with the understanding that 
the grade they obtained in the gambling 
session would be entered in the course 
records in lieu of their score on the 
regularly scheduled midterm examina- 
tion. An ordered metric scale of the 
grades A, B, C, D, and F was derived 
for each subject by the method described 
elsewhere (1, 21). This method derives 
the subject’s scale from his choices 
among alternatives like those exampled 
above. 


8 After all data had been collected, the na- 
ture of the experiment was explained to all 
participants in individual interviews with 
them, and all agreed to take the examination 
and receive a grade based on their perform- 
ance on it. 
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The subjects were required to make 
choices between alternatives in a num- 
ber of offers. Each of the offers was 
numbered, and the subjects understood 
in advance that the offer on which the 
pay-off (midterm grade) would be based 
would be selected at random from all of 
the offers given. Therefore each deci- 
sion they made had an equal likelihood 
of being the crucial one on which their 
midterm grade depended (cf. 21, p. 214; 
20, p. 29). 

After the subjects had made all the 
required choices, two ruses were in- 
troduced. First, the allegedly random 
device by which each student’s grade 
would be determined was actually con- 
trolled so that each subject “won” a C 
for his grade. Some subjects responded 
to this pay-off with considerable dismay 
and disappointment. In response to this, 
the experimenters perpetrated the second 
ruse. The group of subjects was told 
that if they were dissatisfied with the 
grade they had won, they could obtain 
an interview with the instructor and the 
other experimenter, in which perhaps 
some way of raising their grade could 
be worked out. The implication con- 
tained in this announcement was that 
performance of extra work would be the 
mode of raising the grade. The subjects 
were told that any who desired such an 
interview would have to wait until the 
instructor placed an urgent long-dis- 
tance telephone call, after which he and 
the other experimenter would return to 
conduct individual interviews with those 
students who desired them. The in- 
structor said he would return in about 
five minutes, but actually his absence 
was unexplainedly extended to an hour’s 
time before he returned for individual 
interviews. As will become clear, this 
ruse permitted a test of one of the hy- 
potheses of the study. 

The interviews were structured to ob- 
tain a careful independent measure of 
each subject’s level of aspiration for 
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his midterm grade. Both experimenters 
were present, and their independent 
judgments of each subject’s level of as- 
piration were in very close agreement 
(r, = .99). The interviews were con- 
ducted before either experimenter had 
any information concerning any sub- 
ject’s ordered metric scale of utility for 
grades. 

The experiment was designed to test 
two hypotheses. For this summary it 
may suffice to state these briefly and to 
report the outcome of each hypothesis 
test. 

Hypothesis I was that those subjects 
who would not wait for an interview 
with the instructor would be persons on 
whose ordered metric scales the largest 
distance was between D and F—persons 
whose ordered metric scales would re- 
veal that their level of aspiration was 
below the C which they had won. This 
hypothesis was supported by the data. 
Four subjects left the room before the 


experimenters returned from their hour- 


long errand. All had ordered metric 
scales with the largest distance between 
D and F. Seemingly they were satisfied 
with the C grade they had been assured 
and therefore saw no reason to seek an 
interview. 

Hypothesis II was that subjects’ lev- 
els of aspiration, as determined from 
their ordered metric scales of utility, 
would be positively correlated with their 
levels of aspiration as judged from the 
interview material. This hypothesis was 
confirmed: the correlation between the 
two independent indices of level of as- 
piration was r, = .83. 


GOALS FOR FuTURE RESEARCH 


Child and Whiting (3, p. 508) have 
formulated five general statements which 
contain the conclusions which may rea- 
sonably be drawn from research on level 
of aspiration to date: 


1. Success generally leads to a raising of the 
level of aspiration, and failure to a lowering. 
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2. The stronger the success, the greater is 
the probability of a rise in level of aspiration; 
the stronger the failure, the greater is the prob- 
ability of a lowering. 

3. Shifts in level of aspiration are in part a 
function of changes in the subject’s confidence 
in his ability to attain goals. 

4. Failure is more likely than success to lead 
to withdrawal in the form of avoiding of set- 
ting a level of aspiration. 

5. Effects of failure on level of aspiration 
are more varied than those of success. 


Each of these generalizations may be 
treated as hypotheses to be tested in ex- 
perimentation employing the decision- 
theory approach outlined here. If such 
experimentation leads to confirmation of 
the hypotheses, this would strengthen 
confidence in the validity of that ap- 
proach. 

Simon (25, p. 110) points out that in 
most models of rational choice it is as- 
sumed that the individual will evaluate 
the outcomes of all possible alternatives 
before he makes any choice. In realis- 
tic human situations requiring decision, 
however, alternatives often must be ex- 
amined sequentially and decisions must 
be made as the alternatives are pre- 
sented. When this is the case, and when 
the individual does not know what vari- 
ables determine the order of the offers 
(and therefore does not know whether 
the future alternatives are likely to be 
“better” or “worse” than those with 
which he has already been confronted), 
he may well choose the first satisfactory 
alternative which is offered. In the 
terms of the present paper, the indi- 
vidual may choose the first alternative 
containing an outcome at or above his 
level of aspiration. The techniques sug- 
gested in this paper could be used to 
provide an empirical test of this hy- 
pothesis. 

Simon offers another interesting hy- 
pothesis which might be tested in the 
context of the present approach. As the 
individual is being presented with a se- 
quence of choice situations, he may find 
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that it is easy for him to discover satis- 
factory alternatives among them. If so, 
his level of aspiration will rise. On the 
other hand, he may find it difficult to 
discover satisfactory alternatives among 
those made available to him, and in this 
case his level of aspiration will fall. 
Simon suggests that such changes in 
level of aspiration would tend to guaran- 
tee the existence of satisfactory solu- 
tions to the decision situation, for fail- 
ure to discover initially satisfying al- 
ternatives would depress the level of 
aspiration and thereby bring satisfac- 
tory solutions into existence. 


SUMMARY AND CONCLUSIONS 


This paper suggests that Lewinian 
theory concerning level of aspiration 
may be integrated with certain parts of 
decision theory. An achievement scale 
may be viewed as a scale of utility of 
the achievement goals. A formal defini- 
tion of level of aspiration in terms of 
utility is offered. The problem of ascer- 
taining a person’s level of aspiration re- 
duces to the problem of measuring his 
utility of the achievement goals. It is 
hypothesized that level of aspiration is 
associated with the largest distance on 
an individual’s utility scale. If this is 
so, with a discrete set of goals, ordered 
metric measurement is sufficient for 
identifying a person’s level of aspiration, 
since an ordered metric scale contains 
not only a ranking of the entities 
(achievement goals) but also a ranking 
of the distances between them. Cer- 
tain experimental evidence which sup- 
ports the hypothesis is summarized. 
Suggestions for future research are pre- 
sented; these draw upon the ideas pre- 
sented here, together with those of other 
workers in the fields of level of aspira- 
tion and decision theory. 

In conclusion, it would seem that a 
useful behavioral model of decision mak- 
ing should include not only the concepts 
of utility and subjective probability, as 
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do the present models, but should also 
include a formulation of the effects of 
level of aspiration and reinforcement on 
utility. That is, the model should in- 
clude recognition that utility has a 
model in its own right, in which the 
main concepts are level of aspiration 
(LA) and reinforcement effects (R). 
In terms of such an extended model, 
it may be said that if various alterna- 
tives are available to an individual, he 
will choose from among these alterna- 
tives, toward each of which he has a 
subjective probability of attainment and 
a utility, so as to maximize subjectively 
expected utility, SEU. That is, the in- 
dividual will choose so as to maximize 


SEU ba y Pitti, 


where 
u = f(LASR). 
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A NOTE ON THE CRESPI EFFECT 


A. C. PEREBOOM 1 
Texas Technological College 


The classical Crespi (3) and the Tol- 
man and Honzik (6) studies report 
abrupt shifts in performance following 
a discrete shift in the magnitude of the 
reward. The writer recently attempted 
to reconcile their data with his own 
where this “abruptness” effect failed to 
occur (4). This led to certain infer- 
ences about the role of exploratory ac- 
tivity in performance measures. In ad- 
dition, Hull’s habit construct was re- 
vised to improve its handling of the 
learning process following such a shift 
in reward magnitude. The present pa- 


per attempts to extend that study spe- 
cifically to include Crespi’s contrast 
effect. 

It was argued in that paper that the 


early trials in most learning experiments 
involve much exploratory behavior which 
competes, performance-wise, with the one 
response measure that concerns E, that 
based on the acquisition of a goal ex- 
pectancy. Learning about the maze or 
runway takes time, however, and knowl- 
edge about the goal forms only a small 
portion of the total learning. The poor 
performance in the early trials is a meas- 
ure of this larg. amount of learning go- 
ing on as it is reflected in exploratory 
behavior. Thus, even the runway does 
not give us a single response tendency 
(2). But as trials continue, exploration 
drops out and the competing goal re- 
sponse becomes more and more domi- 
nant: performance is now approaching 
an asymptote. 

At this point in the Crespi experi- 
ment the reward magnitude was changed. 
All that his rats now had to learn was 
that a new magnitude was present; run- 


1The writer is indebted to Dr. Keith J. 
Hayes for his critical reading of this paper. 


way exploration, reflected in gradual im- 
provement, was essentially completed 
during the initial learning. The re- 
sponse change was therefore abrupt rela- 
tive to the original rate of response im- 
provement. 

Now consider the contrast effect. The 
level of runway familiarity or learning 
at the time of the reward-shift will be a 
function of the previous amount of run- 
way exploration, and this in turn will 
be partially determined by the magni- 
tude of the initial incentive. A large re- 
ward will compete more effectively with 
the exploratory tendency than a small 
one. This is inferred from the greater 
running speeds found under large re- 
wards. Thus a large-reward group knows 
less about the runway than a small-re- 
ward group (cf. [1]). But unfamiliar- 
ity is a condition that arouses the ex- 
ploratory tendency. If the large-reward 
group is now put on a smaller incentive 
they will not only begin “extinguishing” 
to some extent but they will also begin 
exploring—and to a greater degree than 
would a group on this small incentive 
throughout training, simply because they 
know less about this runway than would 
such a control group. The same reason- 
ing should lead to the “elation” effect 
for the upward-shift group. An initi- 
ally small reward means a large amount 
of initial exploration. Under a new and 
larger amount, they will need to explore 
less than would a group under that 
larger amount throughout training; they 
can therefore run faster than this group. 

But the upward contrast effect does 
not always appear. Negative results 
were obtained in Spence’s laboratory in 
several experiments designed along the 
lines of the Crespi study (5, pp. 127- 
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133). Spence concludes that the “ela- 
tion” effect is due to a failure to train 
animals to their asymptotic speeds, ad- 
ditional trials being the source of the 
effect rather than the incentive shift. 
However, it may not be the asymptote 
per se. If the asymptotic speed is too 
close to some relative upper limit set by 
the given experimental situation, drive 
level, and quality of reward, there sim- 
ply will not be enough room for the ef- 
fect to show. To obtain the contrast 
effect then, the large reward must not 
be too large. Logically, of course, the 
same reasoning should apply to the 
downward effect. If the small reward 
leads to speeds little better than the 
operant speed, we would also fail to ob- 
serve any effect. 

A preliminary test of the present in- 
terpretation will introduce one condition 
into the Crespi study that was appar- 
ently omitted: Thoroughly familiarize 
the animals with the runway prior to 
Trial 1. This should lead to greater 
rates of response improvement under the 
initial incentives, and so tend to elimi- 
nate the relative abruptness of perform- 
ance changes following the incentive 
shifts. In addition, the asymptotes of 
the original performance curves should 
be greater than those obtained by groups 
lacking preliminary exploration: An up- 
ward contrast effect would already be 
present. Whether positive and negative 
contrast effects will appear following the 
incentive shifts, however, depends upon 
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(a) the amount of preliminary explora- 
tion and (4) how long such groups are 
maintained under their initial amounts 
of reward. If no differences in runway 
familiarity are present at the time of 
such incentive shifts, then no contrast 
effects should follow.’ 
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2It might be noted here that Crespi’s use 
of a particularly long runway (20 ft.) upon 
which speed was recorded, rather than just 
latency, no doubt favored the exploratory 
tendency and thus gave him clear-cut results 


(ef. [7]). 
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